This Readme file should be obsolete unless you have not updated to version 1.1.x . See Read ME OOo 1.1
en_BE.zip readooo.html version 0.1.0 January 10, 2004
This is an OpenOffice.org (OOo) (word processor suite), spell-check wordlist
in proper form that has been made by deleting the non-Basic words from
the regular OOo dictionary or spell-check wordlist. This spell-check list was then
tested on most of Ogden's writings that he says are written in Basic English.
Files contained: The ZIP file will automatically download, but you will need
software to UNZIP it.
Many names of people, first and last names, have been removed to get
the file size down while retaining many capitalized proper nouns. Some of these
may be added back as miscellaneous corrections are made in future versions.
See toddlist.txt for a text file
or spellist.xls in Excel, each is without proper nouns and is a
more strict use of derivatives.
If you don't want to use the ZIP form, click on any of the filenames below and
SAVE AS to your disk.
This is a spell checking word list for use with OpenOffice.org software
as a Basic English spell checker.
This list is the default OpenOffice.org spell checking file
with the non-Basic words and derivatives removed. Capitalized words
are retained as proper nouns, except a bunch of person names were
removed to make the file more managable. Many of these will be added
back as miscellaneous corrections are made.
See "toddlist" for a complete Basic English wordlist that works
with OpenOffice.org software that is without proper nouns.
Open Office.org uses a spell check file named en_US.dic. We will substitute our file called
en_BE.dic and rename it to en_US.dic so that it will be recognised by Open Office.
Go to folder: (your path)/openoffice/user/wordbook/
This is what changes in OOo 1.1
|First, make a backup copy of the original en_US.dic: |
|We suggest changing the original filename from : ||en_US.dic ||to : ||en_US.dicori.|
|Copy the new file : ||en_BE.dic ||into: ||(yourpath)/openoffice/user/wordbook/|
|Rename the new file : ||en_BE.dic ||to : ||en_US.dic.|
|That is all there is.|
| Rename : ||en_US.dic ||back to: en_BE.dic|
| Rename : ||en_US.dicori ||back to: en_US.dic|
Details: - More than you need to know:
The OpenOffice.org affix file has been reduced from 22 options available for OOo to
Basic English's simpler options. There is no specific need to replace the original en_US.aff file
except for some minor efficiency with the smaller file. This affix file of standard prefix and suffix
allows the word list to contain a base word coded with legitimate prefixes and suffixes.
This allows the word list to be smaller. The original affix file works fine with Basic English because illegal suffixes are not coded in the Basic English wordlist. If you want to substitute the Basic English affix file, follow the same instructions as for substituting the wordlist.
Inside the file.
The number at the top is the word count. This saves the system
from having to do two passes thru the file. Therefore when you add
your name and town to the list, you will want to increase the
"TRY esianrtolcdugmphbyfvkw" in the affix file is the
the alphabetic search frequency list.
While working with the OOo wordlist (dictionary), several hundred
Basic English derivatives were found and added to the developers
spreadsheet and to toddlist. This is to point out that these are
preliminary files. However, testing the spell checker against Ogden's
works only found a few dozen corrections.
Spell checking software often makes use of "affix" files and an
algorithm to add prefix and suffix forms. The OpenOffice.org
affix file currently has 22 affixes defined. Ogden's Basic English
will make use of 9 of these. Ogden allows the use of -est for single
syllable words, which might entail a 10th. Affix files have some
idiosyncrasies; for example, re- is one of seven prefix options and
has a code value of option A. The word "read" is coded as ad/A.
This means that manual preparation of files will contain human errors.
There are programs to create lists of these files, called, "munch" and "unmunch,"
that use Unix utilities in the OpenOffice.org project.
OpenOffice.org is supposed to allow use of multiple languages, but
some correspondence indicates that feature is not working yet.
Basic English desires to use this feature to allow each user to
select additional specialty and detail word lists to include.
The standard language code is: a 2-character, lower case letters
to indicate the language. "en" means English; an underscore; and
2-capital letters indicating the language as spoken in a country.
US, BR and CA are USA, United Kingdom, and Canada. Thus en_US
is American English.
"be" language is Bellarussian.
"BE" country is Belgium.
As Brussels is the capital of the common market, the use of
that symbol for "Basic English: International Second Language" might
not be inappropriate. Which is why we use that name here.
Notes about OOo.
Somebody owns the word "OfficeOffice" so the software must be called
OpenOffice.org or OOo . Wonder what the story there is?
OOo calls a spell-checking wordlist, a dictionary.
They call a translation table, a thesaurus or synonym list.
An OOo dictionary, .dic, file is a text file saved as with as OpenOffice
word processor as "text encoded", with LF, without CR. (It saves one
carriage return per word, which is about 10%.) Saving as a regular
text filewith any text processor will work fine in case you want to play with the list.
Just be sure to rename to .dic before using.
OpenOffice QuickStart must be "off" to recognize new dictionaries
or affix files. Otherwise it is not recognized until the next system restart.
This list is not yet approved by the OpenOffice.org project.
After we get the words stabilized and in the proper form, the
Basic English language will be submitted to the OOo Project for consideration.
"TRY esianrtolcdugmphbyfvkw" is in the affix file header.
This is the alphabetic search frequency.
The one included here is provided by OpenOffice.org.
This frequency list differs from the one we learned for ciphers :
"etaonrishdlcm . . ."
Checking some web resources, we find a variety, but they all differ
markedly from the OOo version. Perhaps they have customized for
the specific affix application.
etainoshrdlucmfwygpbvkqjxz Samuel F B Morse
The OpenOffice.org default letter frequency is suitable to translate
standard English text. A new one may have to be developed for
Basic English, sometime, that is customized for spell checking text
that is already in Basic English.
Sample of the data:
The affix options used after a slash for Basic English are: (See OOo for complete instructions.)
About this Page: readooo.html -- Readme file of a Basic
English version of OpenOffice.org spellcheck wordlist (dictionary).
Last updated January 10, 2004.