Forum Moderators: phranque

Message Too Old, No Replies

plaintext list of all english words

usr/dict/words a little too extensive

         

monie

9:32 am on Jan 14, 2010 (gmt 0)

10+ Year Member



Hi,
I'm in the process of creating a game in which users try to build compound words. I wrote a program that, given an alphabetized list of words, can find all the combos that make a compound word.

My problem is that usr/dict/words contains prefixes, shorthand, acronyms, and regional words ("bo", "ay", "er", "sh") so the program is defining lots of things as compound words when it shouldn't. (E.g. it thinks snacker, or "snack"+"er", is a compound word.)

All the plaintext dictionaries I've found are based on project Gutenberg and have the same problem: as dictionaries, they were meant so people could, knowing a word, get its definition. They weren't intended to be used for programmers to determine whether a given string is "valid". So if it's debatable whether something is a word at all, they include it.

Is there any list of words out there that contains all "real" English words? All words which the Mac spell-checker accepts would be perfect.

monie

12:45 pm on Jan 14, 2010 (gmt 0)

10+ Year Member



Answered my own question.

Can I post the link for others if I didn't make it/have no connection to the creator?