SIMPLE (ENGLISH) WIKIPEDIA Wordlist for Spell Checking
Draft -- this note is in transition between a prior Wiki
readme for developers and a simplified, version with only one main file for use by Wiki writers. This Wiki note will look similar to the ones for Basic English and for Simple English. The procedures are the same, only the word list vocabularies are different -- VOA Special English is added to make WIKI version. A couple of file names will change, but is otherwise the same. If this draft is not clear, then the instructions in thoses file may be helpful.
Simple Wikipedia English [called WIKI herein] has no firm definition(s) but it is generally regarded as Basic English (so as to be able to say anything) ; Plus the 1000 most frequent English words (not already in Basic, so as to give fluidity) ; Plus VOA Special English (because of if its wide access for beginners). The files need to install Simple WIKI have been provided for download for
use in Open Office.org word processing suite (and more) wherein the non-WIKI words are
highlighted as if misspelled.
Also included is a thesaurus feature that will offer a drop down list of suggested wording for translation from full English to Basic English.
Purpose:
Provide a spell checking filter for use in writing Simple WIKI English by use with HunSpell(MySpell) software
that is most notably used by the free office suite, OpenOffice.org . The vocabulary of
Simple WIKI English is composed (recommended) as Basic English at "next step" level plus
the most Frequent words in English, plus VOA Special English.
Files included:
a. readwiki.html 12B (this note)
b. WIKI.dic -- 4760 root words. 50 KB
Consists of Basic 1500, plus Frequent 1000 (adds 680 words) plus VOA-SE (adds 450 words).
Does not include proper nouns (capital letter words, 162KB)
c. WIKI.aff 2KB An "affix" includes both prefixes and suffixes. This is a feature of the software that adds some efficiency. It is the same as en_US.aff. You need not be concerned with it.
d. dictionary.lst (this lists dictionaries that are available to Open Office and replaces the dictionary.lst that comes with OOo and allows separate languages of Basic, Simple English, and WIKI.
e. th_en_BE.dat 905KB , drop down list of suggestion for Basic English wording.
f . th_en_BE.idx 357KB , an index for thesaurus.
Optional
g. readwiki.html 17KB , this note.
h. readwikimore.html 15KB , for language developers, might be included
i. capital.dic 169KB , optional word list
TO INSTALL
1 . Download the Simple WIKI English spell checking dictionary from the Basic English Institute download page.
a . Download : WIKI.zip
Unzip it to some temporary file. It will contain these files :
Plus this note, plus, maybe, another note with more detailed information for
developers and an optional Capitalized word list.
b . Copy the five files to the OOo dictionary file.
It may be on path : C:\Program Files\OpenOffice.org 2.4\share\dict\ooo\
If not, on your system, then search for "en_US.dic" and add the five files to that same directory folder.
Note : "dictionary.lst" will already exist -- if asked to replace
the old file, answer "Yes".
2 . Add WIKI language to OpenOffice.
Start OpenOffice.org writer.
a . On the taskbar at the top of the page,
select Tools
then Options, near the bottom
then Language Setting, near the middle
then Writing Aids
In the User-defined Dictionaries box in the middle
Select New ...
Dictionary Name -- enter : WIKI
Language -- select from list : English(Ghana)
Close this box.
Look at Options box on the bottom of that page and be sure that
these are NOT marked.
[ ] "Check in all languages"
[ ] "Do not mark errors".
Click OK
b . Again, on the taskbar a the top of the page
select Tools
then Options at the very bottom of the list
then Language Setting
then Languages
Under "Default languages for documents" | Western
select : English(Ghana)
click "OK"
That is all that is needed
Exit OpenOffice.org
Exit OOo QuickStart icon, if it is there.
To USE:
Restart OpenOffice.org and try it.
Any new writing in OOo will be spell checked for Simple WIKI English. Type a few words.
Note that you can change languages for different documents. You can even change the spell checking language for this document, or for a paragraph within this document.
For example, you might spell check in full English to be sure of your spellings, then switch to WIKI English to see if there any words that are not in the WIKI vocabulary.
Look at :
Tools , on the tool bar at top
Language , second line.
For all Text ...
More ...
Select English(Ghana) or whatever language you wish.
TO UNINSTALL :
There are no registry entries. Simply delete or don't use any features
that are no longer wanted.
FUN STUFF
Using OpenOffice.org Writer,
Open file (path/)WIKI.dic.
Add your name, town, street, etc., one word per line.
Sort alphabetically.
Change the line count in the first line to the new number.
Save
Close OpenOffice.org - also close QuickStart if it is there.
Restart OOo.
Test using your name, town.
TROUBLE SHOOTING. -- incomplete.
ReRead the installation instruction.
Pay attention to spelling.
Language Tools:
Options ... etc.
Language has a check mark.
Look at dictionary.lst
C:\Program Files\OpenOffice.org 2.0\share\dict\ooo\dictionary.lst
Confirm this line exists:
English (Ghana) will now be recognized as a language with spell checking capabilities.
Configure the OOo text processor to recognize the language "English (Ghana)" as either "Default"
or "For the current document only."
... (more) ...
Exit OOo QuickStart and re-start OOo.
OpenOffice QuickStarter must be "off" only once, after changes to recognize new dictionaries or affix files. QuickStarter is no longer useful for OOo 2.4 and higher and may be permanently turned off.
DISCUSSIONS
Basic 1500
Every learner of Basic English is expected to know the 850 words,
the international words, six affixes, and complex words, plus
one area of General interest with 100 words, such as Science, Business,
or Verse. And one Specialty detail within that general topic with an additional 50 words
such as Biology, Economics, or Bible -- which are NOT included in WIKI.
Basic English is a full language for general living
and for working as an auxiliary international language. It is good English, just simple. The limited vocabulary allows
quick learning -- weeks, not years. Obviously it is an excellent first step in learning
full English because it allow almost immediate immersion into daily English-speaking life.
Note, Basic English is a subset of Standard English with simple rules of grammar -- there is
NO unlearning required to progress to full English. The originators of Basic also provide
a learning path beyond basic Basic (adding 150 Next Step words and 350 Subsequent words) at
which point the learner should be able to continue at his own pace. Because Simple WIKI English comes after Basic English, the expanded, "next step" Basic is included.
For Simple WIKI use we have included the general subject words, but NOT
the Speciality lists of Basic. We included these, plus the "next step" words for
from Basic towards full English. This combined list is sometimes referred to as the Basic 1500.
The First Supplement of 150 words for common
foods, plants, and animals has been lost. If found they will be added, else a good guess will be provided WIKI vocabulary.
There is much overlap between the three sources. For example,
98 of most frequent 100 words are already in Basic ,
Half of VOA-SE words are also Basic words. We have attempted to remove duplicates.
Capitalized Words
Please note that all proper nouns (capitalized words) are allowed in Basic English. The capitalized word feature is or is not yet built into this version of WIKI, if not then most capitalized words will show as misspelled. Just disregard them, but you will have to check those spelling yourself.
The following will be removed to readwikimore.html sometime. Once there it will be edited better than what is here.
If you are interested in more information, a longer version follows of what
you have just done that tells why you have done them and what is going on inside the program.
Language developers and those needing a different definition of vocabulary for Simple.wikipedia.org.
A full range of Basic, special word lists, frequency, VOA, and capitalized word files may be provided for a Simple WIKI English Developer. Comment within SimpleWiki is desired and perhaps a standard definition may be developed of the SimpleWiki vocabulary. Until such time as
some standards are established, we offer a Simple WIKI English writer spell check and translation of non-Simple words into Basic English (not into the wider WIKI Simple English).
Capitalized Words
The capitalization dictionary is included in this download, but NOT included in the WIKI word list -- maybe next month. HOWEVER, you can do it yourself -- UNZIP the capitalized word list ; concatenate capital.dic with WIKI.dic ; sort ; change the line count at the top ; save.
More than you may want to know.
See page Read Wiki More
Long Version of Installation with some explanations.
It may have a path something like this :
C:\Program Files\OpenOffice.org 2.0\share\dict\ooo\dictionary.lst
Add or confirm this line :
English (Ghana) will now be recognized as a language with spell checking capabilities.
Configure the OOo text processor to recognize the language "English (Ghana)" as either "Default"
or "For the current document only."
Dictionary Details:
Note : Many people new to simple languages are surprised that a few root words multiply into many times their number of spellings and senses. Learning the Basic 850 results in over 5000 simple derivatives and compound words.
Example : "equal" becomes equaled, equaler, equaling, equally, equals, unequal, unequaled, unequally.
Common words making complex words are : -able, -full ; any-, out-. over-, short-, side-, some-, under-, up/upper- , work-.
Complex words have not been added for Frequent and VOA words
to this trial dictionary. VOA is silent on rules for derivatives -- we have used Basic English rules.
Note : Filenames ending in .aff, .dic, and .txt are simple text files that can be read/edited with any simple text editor.
The number at the top is the word count. This make the program work more efficiently. Therefore when you add your name, town, etc. to the list, you will want to increase the word count.
Affix file.
Spell checking software often makes use of "affix" files and an
algorithm to add prefix and suffix forms to the root word. You do not
need to be concerned about the affix file.
The name WIKI is pre-set in the dictionary.lst file as a country dialect of
English.
It is preset as: en GH WIKI
You will want to change this to : en GH WIKI
en GH indicates language of a document of English with a country dialect of Ghana, will use WIKI as the name of the spellcheck dictionary and affix file.
Note English (Belize) is currently used for pure Basic. And English (Jamaica) for Basic 1500. English (Zimbabwe) for Simple English (without VOA SE). Basic will be used by the most skilled Simple Wiki writers. ;^)
dictionary.lst file.
A file suitable for Basic English, Next Step Basic, Simple English
and simple WIKI usage is provided to replace the file that came with OOo.
The additional features of Hyphenation and Thesaurus are pre-set
to use full English. Commonwealth users may prefer to change these from US to GB.
Note : Spelling in all files is first American with the most useful examples of Great
Britain included. Wikipedia writers can use either, but try to be consistent within
any one page.
Notes about OpenOffice.org
Download OpenOffice.org,
it is freeware and a large file (96MB), or order it as a CD from one of their
partners.
We paid $5.50 for a copy.
Somebody owns the word "OpenOffice" so the software must be called
OpenOffice.org. Wonder what he story there is?
OOo gives spell-checking word lists the name of dictionary , xxx.dic.
They give a translation table the name of -- a thesaurus or synonym list.
An OOo dictionary, .dic, file is a simple text file saved as
with OpenOffice Writer as text, but with the name-end of .dic . A "techy type" will want to know that if saved as "text encoded", then LF is required, but not CR. (It saves one
carriage return per word, hardly worth it.) Saving as a regular text
file will work fine and is easier to work with for additions and changes.
OpenOffice QuickStarter must be "off", to recognize new or changes to dictionaries
or affix files. QuickStarter is no longer useful for OOo 2.4 and higher and may be permanently turned off.
About this Page: readwiki.html -- Installation instructions for writing aids, spellchecking word list for Simple Wiki using HunSpell (MySpell) software, specifically for use with OpenOffice.org.
Last updated : May 21, 2008. Separate WIKI in dictionary.lst ; add thesaurus ; adjust readwiki.
May 8, 2008 . Replace Basic850 with Basic1500.
May 3, 2008 . Simplify to one main word list and without Capitals.
Created : January 14, 2005. Plan of aids for Simple English / Wikipedia
URL: http://www.basic-english.org/down/readwiki.html
LINKS : Simple English Wiki