SUPPORT THE WORK

GetWiki

spell checker

ARTICLE SUBJECTS
aesthetics  →
being  →
complexity  →
database  →
enterprise  →
ethics  →
fiction  →
history  →
internet  →
knowledge  →
language  →
licensing  →
linux  →
logic  →
method  →
news  →
perception  →
philosophy  →
policy  →
purpose  →
religion  →
science  →
sociology  →
software  →
truth  →
unix  →
wiki  →
ARTICLE TYPES
essay  →
feed  →
help  →
system  →
wiki  →
ARTICLE ORIGINS
critical  →
discussion  →
forked  →
imported  →
original  →
spell checker
[ temporary import ]
please note:
- the content below is remote from Wikipedia
- it has been imported raw for GetWiki
In software, a spell checker (or spell check) is a software feature that checks for misspellings in a text. Features are often in software, such as a word processor, email client, electronic dictionary, or search engine.{{image frame| width = 252| content =Eye have a spelling chequer,It came with my Pea Sea.It plane lee marks four my revueMiss Steaks I can knot sea.Eye strike the quays and type a whirred And weight four it two say Weather eye am write oar wrong It tells me straight a weigh.Eye ran this poem threw it,Your shore real glad two no.Its vary polished in its weigh.My chequer tolled me sew.A chequer is a bless thing,It freeze yew lodes of thyme.It helps me right all stiles of righting,And aides me when eye rime.Each frays come posed up on my screenEye trussed too bee a joule.The chequer pours o'er every wordTwo cheque sum spelling rule.|caption =The original version of this poem was written by Jerrold H. Zar in 1992. An unsophisticated spell checker will find little or no fault with this poem because it checks words in isolation. A more sophisticated spell checker will make use of a language model to consider the context in which a word occurs.}}File:Spell checker.jpg|thumb|Google ChromeGoogle Chrome

Design

A basic spell checker carries out the following processes:
  • It scans the text and extracts the words contained in it.
  • It then compares each word with a known list of correctly spelled words (i.e. a dictionary). This might contain just a list of words, or it might also contain additional information, such as hyphenation points or lexical and grammatical attributes.
  • An additional step is a language-dependent algorithm for handling morphology. Even for a lightly inflected language like English, the spell-checker will need to consider different forms of the same word, such as plurals, verbal forms, contractions, and possessives. For many other languages, such as those featuring agglutination and more complex declension and conjugation, this part of the process is more complicated.
It is unclear whether morphological analysis—allowing for many different forms of a word depending on its grammatical role—provides a significant benefit for English, though its benefits for highly synthetic languages such as German, Hungarian or Turkish are clear.As an adjunct to these components, the program's user interface will allow users to approve or reject replacements and modify the program's operation.An alternative type of spell checker uses solely statistical information, such as n-grams, to recognize errors instead of correctly-spelled words. This approach usually requires a lot of effort to obtain sufficient statistical information. Key advantages include needing less runtime storage and the ability to correct errors in words that are not included in a dictionary.U.S. Patent 6618697, Method for rule-based correction of spelling and grammar errors In some cases spell checkers use a fixed list of misspellings and suggestions for those misspellings; this less flexible approach is often used in paper-based correction methods, such as the see also entries of encyclopedias.Clustering algorithms have also been used for spell checkingde Amorim, R.C.; Zampieri, M. (2013) Effective Spell Checking Methods Using Clustering Algorithms. Proceedings of Recent Advances in Natural Language Processing (RANLP2013). Hissar, Bulgaria. p. 172-178. combined with phonetic information.Zampieri, M.; de Amorim, R.C. (2014) Between Sound and Spelling: Combining Phonetics and Clustering Algorithms to Improve Target Word Recovery. Proceedings of the 9th International Conference on Natural Language Processing (PolTAL). Lecture Notes in Computer Science (LNCS). Springer. p. 438-449.

History

{{refimprove|section|date=January 2019}}In 1961, Les Earnest, who headed the research on this budding technology, saw it necessary to include the first spell checker that accessed a list of 10,000 acceptable words.WEB, Earnest, Les, The First Three Spelling Checkers,weblink Stanford University, 10 October 2011, yes,weblink" title="web.archive.org/web/20121022091418weblink">weblink 22 October 2012, Ralph Gorin, a graduate student under Earnest at the time, created the first true spelling checker program written as an applications program (rather than research) for general English text: SPELL for the DEC PDP-10 at Stanford University's Artificial Intelligence Laboratory, in February 1971.BOOK, Peterson, James, Computer Programs for Detecting and Correcting Spelling Errors, Dec 1980,weblink 2011-02-18, Gorin wrote SPELL in assembly language, for faster action; he made the first spelling corrector by searching the word list for plausible correct spellings that differ by a single letter or adjacent letter transpositions and presenting them to the user. Gorin made SPELL publicly accessible, as was done with most SAIL (Stanford Artificial Intelligence Laboratory) programs, and it soon spread around the world via the new ARPAnet, about ten years before personal computers came into general use.BOOK, Earnest, Les, Visible Legacies for Y3K,weblink 2011-02-18, yes,weblink" title="web.archive.org/web/20110720044806weblink">weblink 2011-07-20, SPELL, its algorithms and data structures inspired the Unix ispell program.
The first spell checkers were widely available on mainframe computers in the late 1970s. A group of six linguists from Georgetown University developed the first spell-check system for the IBM corporation.WEB,weblink Georgetown U Faculty & Staff: The Center for Language, Education & Development, 2008-12-18, yes,weblink" title="web.archive.org/web/20090205140452weblink">weblink 2009-02-05, , citation: "Maria Mariani... was one of a group of six linguists from Georgetown University who developed the first spell-check system for the IBM corporation." Henry Kučera invented one for the VAX machines of Digital Equipment Corp in 1981.NEWS, Teaching Computers to Spell (obituary for Henry Kučera), Harvey, Charlotte Bruce, May-June 2010, Brown Alumni Magazine, 79, The first spell checkers for personal computers appeared in 1980, such as "WordCheck" for Commodore systems which was released in late 1980 in time for advertisements to go to print in January 1981.NEWS,weblink Micro Computer Industries, Ltd., Compute! Magazine, Issue 8, Vol. 3, No. 1, January 1981, Advertisement, 119, Developers such as Maria Mariani and Random HouseNEWS,weblink The Spelling Bee Is Over, PC Magazine, November 1982, 21 October 2013, Advertisement, 165, rushed OEM packages or end-user products into the rapidly expanding software market, primarily for the PC but also for Apple Macintosh, VAX, and Unix.{{cn|date=January 2019}} On the PCs, these spell checkers were standalone programs, many of which could be run in TSR mode from within word-processing packages on PCs with sufficient memory.However, the market for standalone packages was short-lived, as by the mid-1980s developers of popular word-processing packages like WordStar and WordPerfect had incorporated spell checkers in their packages, mostly licensed from the above companies, who quickly expanded support from just English to European and eventually even Asian languages. However, this required increasing sophistication in the morphology routines of the software, particularly with regard to heavily-agglutinative languages like Hungarian and Finnish. Although the size of the word-processing market in a country like Iceland might not have justified the investment of implementing a spell checker, companies like WordPerfect nonetheless strove to localize their software for as many national markets as possible as part of their global marketing strategy.Firefox 2.0, a web browser, has spell check support for user-written content, such as when editing Wikitext, writing on many webmail sites, blogs, and social networking websites. The web browsers Google Chrome, Konqueror, and Opera, the email client Kmail and the instant messaging client Pidgin also offer spell checking support, transparently using previously GNU Aspell and currently Hunspell as their engine. Mac OS X now has spell check system-wide, extending the service to virtually all bundled and third party applications.Some spell checkers have separate support for medical dictionaries to help prevent medical errors.WEB, 2017,weblink Medical Spell Checker for Firefox and Thunderbird, e-MedTools, 2018-08-29, WEB, Quathamer, Dr. Tobias, 2016,weblink German medical dictionary words, Dr. Tobias Quathamer, 2018-08-29, WEB, Friedman, Richard A., D, M, 2003,weblink CASES; Do Spelling and Penmanship Count? In Medicine, You Bet, The New York Times, 2018-08-29,

Functionality

The first spell checkers were "verifiers" instead of "correctors." They offered no suggestions for incorrectly spelled words. This was helpful for typos but it was not so helpful for logical or phonetic errors. The challenge the developers faced was the difficulty in offering useful suggestions for misspelled words. This requires reducing words to a skeletal form and applying pattern-matching algorithms.It might seem logical that where spell-checking dictionaries are concerned, "the bigger, the better," so that correct words are not marked as incorrect. In practice, however, an optimal size for English appears to be around 90,000 entries. If there are more than this, incorrectly spelled words may be skipped because they are mistaken for others. For example, a linguist might determine on the basis of corpus linguistics that the word baht is more frequently a misspelling of bath or bat than a reference to the Thai currency. Hence, it would typically be more useful if a few people who write about Thai currency were slightly inconvenienced than if the spelling errors of the many more people who discuss baths were overlooked.
missing image!
- Spell check.PNG -
A screenshot of Enchant, the AbiWord spell checker.
The first MS-DOS spell checkers were mostly used in proofing mode from within word processing packages. After preparing a document, a user scanned the text looking for misspellings. Later, however, batch processing was offered in such packages as Oracle's short-lived CoAuthor and allowed a user to view the results after a document was processed and correct only the words that were known to be wrong. When memory and processing power became abundant, spell checking was performed in the background in an interactive way, such as has been the case with the Sector Software produced Spellbound program released in 1987 and Microsoft Word since Word 95.In recent years, spell checkers have become increasingly sophisticated; some are now capable of recognizing simple grammatical errors. However, even at their best, they rarely catch all the errors in a text (such as homophone errors) and will flag neologisms and foreign words as misspellings. Nonetheless, spell checkers can be considered as a type of foreign language writing aid that non-native language learners can rely on to detect and correct their misspellings in the target language.Banks, T. (2008). Foreign Language Learning Difficulties and Teaching Strategies. (pp. 29). Master's Thesis, Dominican University of California. Retrieved 19 March 2012.

Spell-checking non-English languages

English is unusual in that most words used in formal writing have a single spelling that can be found in a typical dictionary, with the exception of some jargon and modified words. In many languages, words are often concatenated into new combinations of words. In German, compound nouns are frequently coined from other existing nouns. Some scripts do not clearly separate one word from another, requiring word-splitting algorithms. Each of these presents unique challenges to non-English language spell checkers.

Context-sensitive spell checkers

Recently,{{when|date=March 2019}} research has focused on developing algorithms that are capable of recognizing a misspelled word, even if the word itself is in the vocabulary, based on the context of the surrounding words. Not only does this allow words such as those in the poem above to be caught, but it mitigates the detrimental effect of enlarging dictionaries, allowing more words to be recognized. For example, baht in the same paragraph as Thai or Thailand would not be recognized as a misspelling of bath. The most common example of errors caught by such a system are homophone errors, such as the bold words in the following sentence:
Their coming too sea if its reel.
The most successful algorithm to date is Andrew Golding and Dan Roth's "Winnow-based spelling correction algorithm",JOURNAL,weblink Journal Article, SpringerLink, 22 September 2010, published in 1999, which is able to recognize about 96% of context-sensitive spelling errors, in addition to ordinary non-word spelling errors. A context-sensitive spell checker appears in Microsoft Office 2007,NEWS,weblink Review, Walt Mossberg, 4 January 2007, Wall Street Journal, 24 September 2010, and also appeared in the now-defunct Google Wave.WEB,weblink Google Operating System, googlesystem.blogspot.com, 25 September 2010, WEB,weblink Google's Context-Sensitive Spell Checker, May 29, 2009, 25 September 2010, Grammar checkers attempt to fix problems with grammar beyond spelling errors, including incorrect choice of words.

See also

{{commons category|Spell checking}}

References

External links

  • {{dmoz|/Arts/Writers_Resources/Software/Spelling_and_Grammar/Spell_Checkers|List of spell checkers}}
  • Norvig.com, "How to Write a Spelling Corrector", by Peter Norvig
  • BBK.ac.uk, "Spellchecking by computer", by Roger Mitton
  • CBSNews.com, Spell-Check Crutch Curtails Correctness, by Lloyd de Vries
  • NIU.edu, Candidate for a Pullet Surprise - Complete corrected poem
{{Natural Language Processing}}

- content above as imported from Wikipedia
- "spell checker" does not exist on GetWiki (yet)
- time: 9:06pm EDT - Tue, Aug 20 2019
[ this remote article is provided by Wikipedia ]
LATEST EDITS [ see all ]
GETWIKI 09 JUL 2019
Eastern Philosophy
History of Philosophy
GETWIKI 09 MAY 2016
GETWIKI 18 OCT 2015
M.R.M. Parrott
Biographies
GETWIKI 20 AUG 2014
GETWIKI 19 AUG 2014
CONNECT