Difference between revisions of "Resources for English"

From ACL Wiki
Jump to navigation Jump to search
m (fix link)
 
(64 intermediate revisions by 7 users not shown)
Line 1: Line 1:
Most of the early additions have been moved here from the [http://www.aclweb.org/universe ACL NLP/CL Universe].
+
For other languages, see [[List of resources by language]].
  
==BIBLIOGRAPHY==
+
See also [[Multilingual resources]].
  
* [[Bibliographies]]
+
<!-- Please keep this list in alphabetical order -->
 +
* [[Corpora for English|Corpora]]
 +
* [[Dictionaries (English)|Dictionaries]]
 +
* [[Generation grammars]]
 +
* [[Geographical words (English)|Geographical words]]
 +
* [[Knowledge collections and datasets (English)|Knowledge collections and datasets]]
 +
* [[Lexicons (English)|Lexicons]]
 +
* [[Subject specific resources (English)|Subject specific resources]]
 +
* [[Tools and Software for English|Tools and Software]]
 +
* [[Uncategorized resources]] - ''please help in categorizing''
 +
 
 +
==Other resource lists==
 +
* [[Lists of resources|Other lists of resources]]
  
==BOOKS==
+
==Additional information==
 +
<!-- Please keep this list in alphabetical order -->
  
 +
* [[Anthology Statistics]]
 +
* [[Bibliographies]]
 +
* [[Blogs]]
 
* [[Books]]
 
* [[Books]]
 +
* [[Conferences]]
 +
* [[Courses]]
 +
* [[Journals]]
 +
* [[Newsgroups, mailing lists|Newsgroups and mailing lists]]
 +
* [[Papers]]
  
==COMPREHENSIVE==
+
[[Category:Resources by language|English]]
*[http://www.lboro.ac.uk/research/mmethods/research/software/caqdas_comparison.html CAQDAS Comparison]  -
 
 
 
*[http://www.sil.org/computing/comp-morph-phon.html Computational Morphology and Phonology (at Summer Institute for Linguistics)]
 
 
 
*[http://www.clres.com/lexdata.html Lexicon Acquisition, Development, and Analysis]
 
 
 
*[http://www.ldc.upenn.edu/ Linguistic Data Consortium, University of Pennsylvania]
 
 
 
*[http://www.music.princeton.edu/chant_html/ling.html Linguistics resources list at Princeton University]
 
 
 
*[http://www.phil.uni-passau.de/linguistik/linguistik_urls/ Links to Linguistic and Related Information (University of Passau)]
 
 
 
*[http://www.ims.uni-stuttgart.de/info/FTPServer.html List of resources (at University of Stuttgart)]
 
 
 
*[http://library.utoronto.ca/www/cch_subject_linguistics.html List of resources (at University of Toronto)]
 
 
 
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/corpora/0.html Natural Language Corpora and Dictionaries (at CMU)]
 
 
 
*[http://www.cs.umu.se/~mjm/step/nlis.html Natural Language Interfaces (NLIs) on the Web]
 
 
 
*[http://www.ai.mit.edu/projects/iiip/nlp.html NLP page at MIT]
 
 
 
*[http://www-nlp.stanford.edu/links/statnlp.html Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources]
 
 
 
*[http://www.willamette.edu/~tjones/Language-Page.html The human languages page]
 
 
 
*[http://www.ics.uci.edu/~mlearn/MLRepository.html UCI Machine Learning Repository]
 
 
 
*[http://www.coli.uni-sb.de/~hansu/clwebinfo.html WWW Information on Computational Linguistics and Language Technology]
 
 
 
*[http://www.yahoo.com/Science/Computer_Science/Artificial_Intelligence/Natural_Language_Processing/ Yahoo! index on Natural Language Processing]
 
 
 
==CORPORA==
 
*[ftp://ftp.cs.cornell.edu/pub/smart/time/ 1963 Time Magazine corpus]
 
 
 
*[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html 2000 NIST Speaker Recognition Evaluation Corpus]
 
 
 
*[http://www.coli.uni-sb.de/sfb378/negra-corpus/ A Syntactically Annotated Corpus of German Newspaper Texts]
 
 
 
*[http://ixa.si.ehu.es/Ixa/resources/sensecorpus A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)]
 
 
 
*[http://odur.let.rug.nl/~vannoord/trees/ Alpino Treebank]
 
 
 
*[http://www.cornelsen.de/international/ An Empirical Grammar of the English Verb System]
 
 
 
*[http://www.sultry.arts.usyd.edu.au/links/statnlp.html Annotated list of resources on statistical NLP and corpus-based CL]
 
 
 
*[http://www.aot.ru/search1.html AOT]
 
 
 
*[http://www.ldc.upenn.edu/Catalog/LDC2001T55.html Arabic Newswire Part 1]
 
 
 
*[http://atilf.atilf.fr/dmf.htm Base Textuelle de Moyen Francais]
 
 
 
*[http://thetis.bl.uk/ BNC Online Service]
 
 
 
*[http://bokrcorpora.narod.ru Bokr Russian Reference Corpus]
 
 
 
*[http://info.ox.ac.uk/bnc/ BRITISH NATIONAL CORPUS - WORLD EDITION]
 
 
 
*[http://www.dcs.gla.ac.uk/idom/ir_resources/ Collections of texts and corpora]
 
 
 
*[http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular]
 
 
 
*[http://www.corpusdelespanol.org/ Corpus del Espanol]
 
 
 
*[http://www.hf.uio.no/easteur-orient/bulg/mat/ Corpus of spoken Bulgarian]
 
 
 
*[http://pioneer.chula.ac.th/~awirote/ling/corpuslst.htm Corpus Resources (Chulalongkorn University, Thailand)]
 
 
 
*[ftp://ftp.cs.cornell.edu/pub/smart/cran/ Cranfield collection]
 
 
 
*[http://corpus.rae.es/creanet.html CREA]
 
 
 
*[http://ucnk.ff.cuni.cz/english/index.html Czech National Corpus]
 
 
 
*[http://korpus.dsl.dk/korpus2000/indgang.php Danish news corpus]
 
 
 
*[http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
 
 
 
*[http://www.hum.uva.nl/~ewn EuroWordNet]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/tc/CQP.html Experimental Corpus Query System (University of Stuttgart, Germany)]
 
 
 
*[http://www.csc.fi/kielipankki/ Finnish text bank]
 
 
 
*[http://hometown.aol.com/mit2haiti/Index4.html HAITIAN CREOLE ELECTRONIC TEXTS]
 
 
 
*[http://rali.iro.umontreal.ca/ Hansards Corpus - Searchable]
 
 
 
*[http://www.hcrc.ed.ac.uk/maptask/ HCRC Map Task Corpus XML annotations]
 
 
 
*[http://www.csc.fi/kielipankki/aineistot/hcs/index.phtml.en Helsinki Corpus of Swahili (HCS)]
 
 
 
*[http://nats-www.informatik.uni-hamburg.de/~ingo/icopost/ ICOPOST]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/TC.html IMS Corpus Toolbox, Univ. of Stuttgart]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/ IMS Corpus Workbench (CWB)]
 
 
 
*[http://cecl.fltr.ucl.ac.be/Cecl-Projects/Icle/icle.htm International Corpus of Learner English]
 
 
 
*[http://korpus.pl/en/ IPI PAN Polish Corpus]
 
 
 
*[http://www.ipds.uni-kiel.de/links/datenmaterial.en.html Kiel University's Institute on Phonetics and Speech Procesing]
 
 
 
*[http://www.nilc.icmc.usp.br/lacioweb Lacio Web Corpora]
 
 
 
*[http://www.vuw.ac.nz/llc/ LANGUAGE LEARNING CENTER - ACADEMIC CORPUS]
 
 
 
*[http://www.csse.monash.edu.au/~jwb/afaq/jitadoushi.html list of Japanese transitive - intransitive verb pairs]
 
 
 
*[http://www.dcs.gla.ac.uk/idom/ir_resources/linguistic_utils/stop_words List of stop words]
 
 
 
*[http://www.bmanuel.org/clr2_mp.html Manuel Barbera: General Corpora and Corpus Linguistics Resources]
 
 
 
*[ftp://ftp.cs.cornell.edu/pub/smart/med/ Medlars collection]
 
 
 
*[ftp://ftp.ox.ac.uk/pub/wordlists/ Miscellaneous Word Lists from Oxford University]
 
 
 
*[http://www.lpl.univ-aix.fr/projects/multext/ Multilingual Text Tools and Corpora]
 
 
 
*[http://www.census.gov/genealogy/names Name lists from US census]
 
 
 
*[http://www.di.fc.ul.pt/~ahb/nexing.htm Nexing Corpus]
 
 
 
*[http://www.cs.cmu.edu/web/books.html On-line books at CMU]
 
 
 
*[http://logos.uio.no/opus/ OPUS -- An Open Source Parallel Corpus]
 
 
 
*[http://www.uni-duisburg.de/Fak2/FremdPhil/Romanistik/Personal/Burr/humcomp/ Oxford Text Archive Corpus of Italian Newspapers]
 
 
 
*[http://elex.amu.edu.pl/~przemka/PICLE_search.php Polish subcorpus of the International Corpus of Learner English]
 
 
 
*[http://www.cirp.es/WXN/wxn/frames/proxectos.html Ramon Piero Center for Research]
 
 
 
*[http://about.reuters.com/researchandstandards/corpus/ Reuters Corpus]
 
 
 
*[http://www.cs.unt.edu/~rada/downloads.html Romanian NLP]
 
 
 
*[http://www.sfb441.uni-tuebingen.de/b1/korpora.html Russian Corpora]
 
 
 
*[http://rykov-cl.narod.ru/r.html Russian Corpora]
 
 
 
*[http://www.ruscorpora.ru/ Russian Corpus Page]
 
 
 
*[http://lib.ru/ Russian Corpus Site]
 
 
 
*[http://www.ng.ru Russian Corpus Site]
 
 
 
*[http://www.philol.msu.ru/~lex/corpus/ Russian Newspaper Corpus]
 
 
 
*[http://schools.keldysh.ru/uvk1838/Sciper/volume2/langres/russiclr.htm Russicon Resources]
 
 
 
*[http://sanskritlibrary.org/ Sanskrit Library]
 
 
 
*[http://korpus.juls.savba.sk/index.en.html Slovak National Corpus]
 
 
 
*[http://nl.ijs.si/elan/#corpus Slovene-English Parallel Corpus]
 
 
 
*[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio]
 
 
 
*[http://www.ldc.upenn.edu/Catalog/LDC2001S99.html Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio]
 
 
 
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/doc/notes/corpora.txt Survey of Electronic Corpora (by Jane A. Edwards, file at CMU)]
 
 
 
*[http://www.ucl.ac.uk/english-usage/ Survey of English Usage, University College, London]
 
 
 
*[http://www.icsi.berkeley.edu/real/stp/index.html Switchboard Transcription Project]
 
 
 
*[http://www.tractor.de/ TELRI Research Archive of Computational Tools and Resources]
 
 
 
*[http://childes.psy.cmu.edu/ The Childes Corpus - Children's language]
 
 
 
*[http://nora.hd.uib.no/index-e.html The CORPORA DataCenter (Norway)]
 
 
 
*[ftp://ftp.dcs.shef.ac.uk/share/ilash/Moby/ The Moby Corpus]
 
 
 
*[http://www.tekstlab.uio.no/Bosnian/Corpus.html The Oslo Corpus of Bosnian Texts]
 
 
 
*[http://www.sketchengine.co.uk/ The Sketch Engine]
 
 
 
*[http://www.hf.uio.no/tekstlab/prosjekter/SOFIE.htm The Sofie Treebank - A Parallel Treebank of North European Languages]
 
 
 
*[http://www.cis.upenn.edu/~treebank/tokenization.html Treebank tokenization scheme]
 
===CORPORA - ENGLISH===
 
*[http://www.elda.fr/catalogue/en/speech/S0115.html American English SpeechDat-Car]
 
 
 
*[http://americannationalcorpus.org/FirstRelease/ AMERICAN NATIONAL CORPUS FIRST RELEASE]
 
 
 
*[http://homepage.mac.com/bncweb/ BNCweb a web-based interface to the British National Corpus]
 
 
 
*[http://devoted.to/corpora Bookmarks for Corpus-based Linguists]
 
 
 
*[http://info.ox.ac.uk/bnc/ British National Corpus (from Oxford University)]
 
 
 
*[http://www.comp.lancs.ac.uk/computing/research/ucrel/bnc.html British National Corpus project page (from UCREL)]
 
 
 
*[http://www.athel.com/corpdes.html Corpus of Spoken Professional English]
 
 
 
*[http://www-rcf.usc.edu/~billmann/diversity/DDivers-site.htm Dialogue Diversity Corpus]
 
 
 
*[http://etext.lib.virginia.edu/ Electronic Text Center -- University of Virginia]
 
 
 
*[http://www.phon.ox.ac.uk/~esther/ivyweb/ English Intonation in the British Isles -The IViE Corpus]
 
 
 
*[http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/naive-bayes/bow-0.8/stopwords.c English stop words (from SMART)]
 
 
 
*[http://www-personal.umich.edu/~jlawler/levin.html English Verb Classes And Alternations: A Preliminary Investigation (Index)]
 
 
 
*[http://usna.edu/LangStudy/BNC/ Exploring Words and Phrases from the British National Corpus]
 
 
 
*[http://nora.hd.uib.no/icame.html ICAME]
 
 
 
*[http://www-2.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/naive-bayes/bow-0.8/stopwords.c List of English stopwords]
 
 
 
*[http://www.lsi.upc.es/~nlp/tools/mapping.html Mapping WordNet Versions 1.6 and 2.0]
 
 
 
*[http://www.cs.cornell.edu/People/pabo/movie-review-data/ Movie Review Data]
 
 
 
*[http://mwe.stanford.edu/resources/ Multiword Expression Resources]
 
 
 
*[http://pie.usna.edu/ Phrases in English]
 
 
 
*[http://homepages.feis.herts.ac.uk/~comrcml/Lyon-thesis.ps Restricted English Corpus from Dr. Caroline Lyon for PhD]
 
 
 
*[http://www.sketchengine.co.uk/ Sketch Engine]
 
 
 
*[http://www-2.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/corpora/susanne/0.html Susanne: Annotated American English Corpus]
 
 
 
*[http://clix.to/davidlee00 The BNC Index (for the BNCWorld Edition)]
 
 
 
*[http://www-users.york.ac.uk/~sp20/corpus.html The Brooklyn-Geneva-Amsterdam-Helsinki Parsed Corpus of Old English]
 
 
 
*[http://www-rcf.usc.edu/~billmann/diversity/DDivers-site.htm The Dialogue Diversity Corpus]
 
 
 
*[http://www.grsampson.net/LucyDoc.html The LUCY Corpus - Documentation]
 
 
 
*[http://www.cs.rochester.edu/research/cisd/resources/trains.html TRAINS Dialogue Corpus]
 
 
 
===CORPORA - GERMAN===
 
*[http://www.phonetik.uni-muenchen.de/Bas/BasKorporaeng.html Bavarian Archive for Speech Signals Corpora]
 
 
 
*[http://corpora.ids-mannheim.de/~cosmas/ COSMAS II]
 
 
 
*[http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus]
 
 
 
===CORPORA - MULTILINGUAL===
 
*[http://wt.jrc.it/lt/Acquis/ ACQUIS COMMUNAUTAIRE Multilingual Corpus]
 
 
 
*[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
 
 
 
*[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS]
 
 
 
*[http://www.linguateca.pt/COMPARA/ COMPARA corpus]
 
 
 
*[http://www.debian.org/international/ Debian free software community]
 
 
 
*[http://www.ling.lancs.ac.uk/corplang/emille EMILLE corpus]
 
 
 
*[http://people.csail.mit.edu/people/koehn/publications/europarl/ European Parliament Proceedings Parallel Corpus 1996-2003]
 
 
 
*[http://www.illc.uva.nl/EuroWordNet EuroWordNet]
 
 
 
*[http://www.france.diplomatie.fr/label_france/index.html French Foreign Ministry's magazine]
 
 
 
*[http://glossa.fltr.ucl.ac.be/ GlossaNet]
 
 
 
*[http://hometown.aol.com/mit2haiti/JA-HC-kr.htm Haitian Creole corpus -Teknoloji pou lang kreyol]
 
 
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC95T20 Hansard French-English parallel corpus]
 
 
 
*[http://www.ucl.ac.uk/english-usage/ice/avail.htm ICE corpora]
 
 
 
*[http://www.tu-chemnitz.de/phil/InternetGrammar/ Learner Behaviour on the Internet]
 
 
 
*[http://muchmore.dfki.de/resources1.htm MuchMore Springer Bilingual Corpus]
 
 
 
*[http://nl.ijs.si/ME/ MULTEXT-East: Multilingual Corpora for Eastern and Central European Languages]
 
 
 
*[http://tcc.itc.it/people/forner/multilingualcorpora.html Multilingual Corpora: Available Resources]
 
 
 
*[http://multisemcor.itc.it MultiSemCor]
 
 
 
*[http://www.ims.uni-stuttgart.de/info/Newspapers.html Newspapers on the Internet]
 
 
 
*[http://logos.uio.no/opus/ OPUS - an open source parallel corpus]
 
 
 
*[http://langbank.engl.polyu.edu.hk/indexl.html PolyU Language Bank]
 
 
 
*[http://register.consilium.eu.int/ Public registry of the Council of the EU]
 
 
 
*[http://www.multilingual.com/allen51.htm The Bible as a Resource for Translation Software]
 
 
 
*[http://www.cogsci.ed.ac.uk/elsnet/eci.html The ECI Multilingual corpus]
 
 
 
*[http://www.unhchr.ch/udhr/index.htm UN declaration of human rights in multiple languages]
 
 
 
*[http://www-igm.univ-mlv.fr/~unitex/ UNITEX]
 
 
 
*[http://www.u-grenoble3.fr/kraif/liens.htm Useful links about parallel corpora, by Olivier Kraif]
 
 
 
*[http://wacky.sslmit.unibo.it/ WaCky Project]
 
 
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
 
 
 
*[http://www.wortschatz.uni-leipzig.de/html/wliste.html Wortlisten: spoken German, English, French, and Dutch]
 
 
 
==COURSES==
 
*[http://www.cs.cmu.edu/afs/cs/academic/class/11761-s97/WWW/index.html 11-761 Language and Statistics, course at CMU, Spring 1997]
 
 
 
*[http://www.linguistics.ucla.edu/nasslli04/index.html 3rd NASSLLI: North American Summer School in Logic, Language and Information]  
 
 
 
*[http://www.alta.asn.au/events/altss2004/ ALTSS 2004: Australasian Language Technology Summer School]
 
 
 
*[http://www.cs.brandeis.edu/~llc/cs114/ Computational Linguistics, James Pustejovsky, Brandeis University]
 
 
 
*[http://www.cstit.cl.cam.ac.uk/gateway/ Computer Speech, Text and Internet Technology]
 
 
 
*[http://www.ling.lancs.ac.uk/monkey/ihe/linguistics/contents.htm Course in Corpus Linguistics, Tony McEnery & Andrew Wilson]
 
 
 
*[http://courses.cs.cornell.edu/cs674/2000SP/ CS674: Natural Language Processing (Cornell U., Spring 2000)]
 
 
 
*[http://people.cohums.ohio-state.edu/chan9/c889-w01.htm Databases and Corpora for Chinese Linguistic Research]
 
 
 
*[http://www.macs.hw.ac.uk/esslli05/ ESSLLI 2005 - 17th European Summer School in Logic, Language and Information]
 
 
 
*[http://www.linguistik.uni-erlangen.de/~rrh/Slides.html Foundations of Computational Linguistics by Roland Hausser]
 
 
 
*[http://www.cs.cornell.edu/courses/CS630/2006sp/ Human Language Technology (Cornell CS630, Spring 2006)]
 
 
 
*[http://www.coli.uni-sb.de/conf/Verb-Workshop-05/ Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes]
 
 
 
*[http://www.ucl.ac.uk/internet-grammar/home.htm Internet Grammar of English]
 
 
 
*[http://www.ling.gu.se/~lager/kurser/ExploringOz/ Multi-Paradigm Programming in Oz for NLP]
 
 
 
*[http://www.cs.cornell.edu/home/llee/naacl/summer-school/04/ NAACL-Supported Two-Week Summer School in Human Language Technologies]
 
 
 
*[http://www.colorado.edu/ling/courses/CourseArchives/spring99/6300/lab11.html Perl: extending your pos-tagger using regular expressions, Dan Jurafsky]
 
 
 
*[http://omega.math.albany.edu:8008/JaynesBook.html Probability Theory: The Logic Of Science (by E. T. Jaynes, Washington University, Saint Louis)]
 
 
 
*[http://www.sfwa.org/members/elgin/ REAL WORLD LINGUISTICS 101]
 
 
 
*[http://www.cs.berkeley.edu/~russell/aima.html Russell and Norvig - AI]
 
 
 
*[http://fr46.uni-saarland.de/tdc2004/ Short intensive course: Texts, Discourse and Corpora: Corpora in Linguistics and Related Fields]
 
 
 
*[http://www.cs.colorado.edu/~martin/slp.html Speech and Language Processing, by Daniel Jurafsky and James Martin]
 
 
 
*[http://www.cs.cornell.edu/courses/cs775/2001sp/default.html Statistical Natural Language Processing: Models and Methods]
 
 
 
*[http://www-english.tamu.edu/linguistics/ Texas A&M University Linguistics Course Listings]
 
 
 
==DICTIONARY==
 
*[http://www.notam.uio.no/~hcholm/altlang/ Alternative dictionaries]
 
 
 
*[http://www.elra.info Bilingual Dictionary French Arabic]
 
 
 
*[http://dictionary.cambridge.org/researchers.htm Cambridge Learner Dictionary]
 
 
 
*[http://www.canoo.net/ Canoo.net - German Dictionaries and Grammars]
 
 
 
*[ftp://ftp.cs.cmu.edu/project/fgdata/dict/ CMU pronunciation dictionary]
 
 
 
*[ftp://cs.nyu.edu/pub/html/comlex.html/README.html Comlex Syntax (Syntactic Dictionary of English)]
 
 
 
*[http://ota.ahds.ac.uk/texts/2469.html CUVPlus -- Oxford Text Archive]
 
 
 
*[http://dix.osola.com/ Deutsch-Spanisch Wˆrterbuch (German-Spanish Dictionary)]
 
 
 
*[http://www.uni-frankfurt.de/~felix/eurodictautom.html ECHO - Eurodicautom (multilingual technical dictionary)]
 
 
 
*[http://www.ibs.ee/dict/index.html English to Estonian]
 
 
 
*[http://www.freedict.org/en/flags-dict-tgz.html FreeDict Downloads]
 
 
 
*[http://nlp.stanford.edu/kirrkirr/ Kirrkirr 4.0 Dictionary Program]
 
 
 
*[http://dict.leo.org/ LEO: Deutsche-Englisches Wˆrterbuch (German-English Dictionary)]
 
 
 
*[http://www.ims.uni-stuttgart.de/tcl/RESOURCES/German-Lexicon-en.html Lexical information for German]
 
 
 
*[http://www.sil.org/glossary/linguistic_glossary.html Linguistics Glossary (from SIL)]
 
 
 
*[http://www.lai.com/lai/glossaries.html List of on-line dictionaries (from lai.com)]
 
 
 
*[http://longdo.ex.nii.ac.jp/ Longdo Thai Dictionary]
 
 
 
*[http://www.laum.uni-hannover.de/iln/bibliotheken/woerterbuecher.html On-line dictionaries (from Univ. of Hannover)]
 
 
 
*[http://www.nada.kth.se/skolverket/sve-eng.html Swedish to English]
 
 
 
*[http://www.cse.unsw.edu.au/~billw/nlpdict.html The NLP Dictionary]
 
 
 
*[http://www.wordsmyth.net Wordsmyth Children's Dictionary]
 
 
 
*[http://www.yourdictionary.com/ YourDictionary]
 
 
 
==FTP==
 
*[ftp://parcftp.xerox.com/pub/ Xerox PARC FTP site.]
 
 
 
==JOURNAL==
 
*[http://journals.cambridge.org/phpAds/adclick.php?bannerid=198 Natural Language Engineering]
 
 
 
*[http://www.acm.org/tois/ ACM Transactions on Information Systems]
 
 
 
*[http://www.elsevier.nl/freeinfo/mathcompcog/505626/505626.htm Cognition, a journal from Elsevier Science]
 
 
 
*[http://mitpress.mit.edu/COLI Computational Linguistics]
 
 
 
*[http://www.ida.liu.se/ext/etai/ ETAI - Electronic Transactions on Artificial Intelligence]
 
 
 
*[http://www.comp.leeds.ac.uk/icame/ ICAME Journal]
 
 
 
*[http://www.ijece.org/ IJECE]
 
 
 
*[http://www.elsevier.nl/inca/publications/store/2/4/4/ Information Processing and Management]
 
 
 
*[http://www.wkap.nl/journalhome.htm/1386-4564 Information Retrieval (Journal)]
 
 
 
*[http://www.jair.org/ Journal of Artificial Intelligence Research (JAIR)]
 
 
 
*[http://www.wkap.nl/journalhome.htm/0925-9902 Journal of Intelligent Information Systems]
 
 
 
*[http://logcom.oxfordjournals.org/ Journal of Logic and Computation]
 
 
 
*[http://www.cup.org/journals/jnlscat/nle/nle.html Journal of Natural Language Engineering]
 
 
 
*[http://www.asis.org/Publications/JASIS/jasis.html Journal of the American Society for Information Science]
 
 
 
*[http://tigger.cs.uwm.edu/~syali/jnle-kr-nlp/ Knowledge Representation for Natural Language Processing in Implemented Systems]
 
 
 
*[http://www.cs.utexas.edu/users/ml/mlj-nll/ Machine Learning Journal Special Issue on Natural Language Learning]
 
 
 
*[http://www.wkap.nl/journalhome.htm/0922-6567 Machine Translation]
 
 
 
*[http://www.marges-linguistiques.com/ ONLINE LINGUISTICS JOURNAL]
 
 
 
*[http://www.essex.ac.uk/web-sls/ WEB-SLS: The European Student Journal of Language and Speech]
 
 
 
==LANGUAGE==
 
*[http://www1.cs.columbia.edu/~mdiab/software/ASVMTools_2.0.tar.gz Basic Arabic Processing Tools]
 
 
 
*[http://www.dunglish.nl/ Dunglish]
 
 
 
*[http://www.up.univ-mrs.fr/veronis/donnees/index.html French Stopword List]
 
 
 
*[http://www.ivrix.org.il/projects/spell-checker/ Hebrew Spellchecker]
 
 
 
*[http://www.up.univ-mrs.fr/tresoc/ Le TrÈsor de la Langue Langue d'Oc]
 
 
 
*[http://actarus.atilf.fr/morphalou/ Lexique Morphalou]
 
 
 
*[http://online.anu.edu.au/asianstudies/ahcen/proudfoot/MCP/ Malay Concordance Project]
 
 
 
*[http://earth-info.nga.mil/gns/html/cntry_files.html Names Files of Selected Countries]
 
 
 
*[http://orleans.lti.cs.cmu.edu/Reap/ REAP Project: Reader-Specific Lexical Practice for Improved Reading Comprehension]
 
 
 
*[http://people.cs.uchicago.edu/~dinoj/icsi97syl.disc.gz Syllable-Level Conversational English Transcriptions]
 
 
 
*[http://spraakbanken.gu.se/lb/ The Bank of Swedish - A Linguistic Reference Database of G&ouml;teborg University]
 
 
 
*[http://www.sil.org/mexico/pub/vimsa.htm The Mariano Silva y Aceves Series]
 
 
 
*[http://www.unine.ch/info/clef/ UniNE stopword list for Portuguese]
 
 
 
*[http://geonames.usgs.gov/domestic/download_data.htm United States Geographic Names]
 
 
 
*[http://www.valencianlanguage.com/ Valencianlanguage.com]
 
 
 
 
 
==MAILING==
 
*[http://ling.ohio-state.edu/HPSG/Majordomo.html HPSG Mailing List]
 
 
 
*[http://www.eamt.org/mt-list.html MT List]
 
 
 
*[https://mailman.rice.edu/pipermail/funknet/2002-September/002331.html Natural Semantic Metalanguage List]
 
 
 
*[http://www.sigir.org/sigirlist/issues/ SIG-IRList Archives]
 
 
 
*[http://www.hd.uib.no/ The CORPORA list]
 
 
 
==ONLINE==
 
*[http://www.ldc.upenn.edu/exploration/survey.html A Survey of Open Language Archives]
 
 
 
*[http://www.siggen.org/resources/ ACL SIGGEN Resources Wiki]
 
 
 
*[http://www.cs.kun.nl/agfl/ AFGL Parser Generator]
 
 
 
*[http://www.let.rug.nl/~vannoord/alp/ Algorithms for Linguistic Processing]
 
 
 
*[http://www.a-i.com/ Artificial Intelligence NV (Ai)]
 
 
 
*[http://www.eprints.org/ Author/Institution Self-Archiving]
 
 
 
*[http://www.chinesecomputing.com Chinese Computing]
 
 
 
*[http://www.copernic.com/ COPERNIC 2000]
 
 
 
*[http://www.linguateca.pt/corpografo CorpÛgrafo]
 
 
 
*[http://www.cis.upenn.edu/~dbikel/#stat-parser Dan Bikel's Parser]
 
 
 
*[http://java.sun.com/docs/books/tutorial/i18n/text/boundaryintro.html Detecting Text Boundaries]
 
 
 
*[http://www.catchword.com/era Educational Research Abstracts]
 
 
 
*[http://emotion-research.net/wiki/Databases Emotional Databases]
 
 
 
*[http://lingo.stanford.edu/erg.html English Resource Grammar]
 
 
 
*[http://www.cjk.org/cjk/samples/chincomc.htm English-Chinese Chinese-English Dictionary of Computer Terms]
 
 
 
*[http://www.freelangonline.com/ Freelangonline - many on-line dictionaries + more]
 
 
 
*[http://dmoz.org/Computers/Software/Information_Retrieval/ Information Retrieval]
 
 
 
*[http://ir.dcs.gla.ac.uk/resources.html IR resources]
 
 
 
*[http://odin.prohosting.com/hkkim/cgi-bin/kaeps/ Korean Accented English Pronunciation Simulator]
 
 
 
*[http://www.kwicfinder.com/KWiCFinder.html KwicFinder Web Concordancer and Online Research Tool]
 
 
 
*[http://www.academiaisla.com/acadi/gen/0_en.html LANGUAGE LINKS]
 
 
 
*[http://www.link.cs.cmu.edu/lexfn/ Lexical FreeNet]
 
 
 
*[http://www-lfg.stanford.edu/lfg/ilfga LFG Database: List of Names]
 
 
 
*[http://lse.umiacs.umd.edu/ Linguist's Search Engine]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/ Linguistic Interpretation of a German Corpus]
 
 
 
*[http://www.link.cs.cmu.edu/ LINK GRAMMAR PARSER]
 
 
 
*[http://www.eturner.net/linkgrammar-wn/ LinkGrammar-WN project]
 
 
 
*[http://www.psy.uwa.edu.au/MRCDataBase/uwa_mrc.htm MRC Psycholinguistic Database]
 
 
 
*[http://nl.ijs.si/ME/V3/ Multext East Resources, Version 3]
 
 
 
*[http://multiwordnet.itc.it/english/home.php MultiWordNet]
 
 
 
*[http://www.comp.nus.edu.sg/~rpnlpir/ Natural Language Processing / Information Retrieval Software Repository]
 
 
 
*[http://nlsh.sourceforge.net/ NLSH: Natural Language Shell]
 
 
 
*[http://www.irisa.fr/Omphalos/ Omphalos Context-Free Language Learning Competition]
 
 
 
*[http://ysomeya.hp.infoseek.co.jp/ Online Business Letter Corpus KWIC Concordancer]
 
 
 
*[http://www.grsampson.net/RLeafAnc.html Parse Evaluation]
 
 
 
*[http://pie.usna.edu Phrases in English and the British National Corpus]
 
 
 
*[http://pygoogle.sourceforge.net/ PyGoogle: A Python Interface to the Google API]
 
 
 
*[http://www.ai.uga.edu/mc/PythonForNewbieLinguists.html Python Programming Tutorial]
 
 
 
*[http://corpus.leeds.ac.uk/internet.html Query to Internet Corpora]
 
 
 
*[http://www.lexmasterclass.com/exercises/regex/index.html Regular Expression Exercises]
 
 
 
*[http://www.sims.berkeley.edu/~hjiang1/eng_chi_resources.html Resources for English-Chinese CLIR]
 
 
 
*[http://www.cs.technion.ac.il/~gabr/resources/resources.html Resources for Text, Speech and Language Processing]
 
 
 
*[http://www.sfu.ca/rst/ Rhetorical Structure Theory (RST)]
 
 
 
*[http://www.philol.msu.ru/rus/galya-1 Russian Phonetics on the Web]
 
 
 
*[http://www.clres.com/SensSemRoles.html Senseval-3 Task: Automatic Labeling of Semantic Roles]
 
 
 
*[http://www.clres.com/SensWNDisamb.html Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses]
 
 
 
*[http://www.cs.unt.edu/~rada/wa/ Sentence Alignment and Word Alignment: Projects, Papers, Evaluation, Etc.]
 
 
 
*[http://ontoweb-lt.dfki.de/knowledge_index.htm SIG5 OntoWeb]
 
 
 
*[http://sara.natcorp.ox.ac.uk/lookup.html Simple Search of BNC-World]
 
 
 
*[http://www.sigsem.org/ Special Interest Group on Computational Semantics]
 
 
 
*[http://www.grsampson.net/RSue.html SUSANNE Analytic Scheme]
 
 
 
*[http://www.telemakus.net/ Telemakus: Mining and Mapping Research Findings to Promote Knowledge Discovery]
 
 
 
*[http://www.clef-campaign.org/ The Cross-Language Evaluation Forum]
 
 
 
*[http://www.robotwisdom.com/web/biography.html The Internet Timelines Project]
 
 
 
*[http://www.fb10.uni-bremen.de/anglistik/langpro/NLG-table/NLG-table-root.htm The John Bateman and Michael Zock's list of Natural Language Generation Systems]
 
 
 
*[http://www.RosettaProject.org/ The Rosetta PrOject]
 
 
 
*[http://www.cis.upenn.edu/~xtag The XTAG Project]
 
 
 
*[http://www.TSrali.com/ TransSearch]
 
 
 
*[http://www-nlpir.nist.gov/projects/trecvid/ TREC Video Retrieval Evaluation Page]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html TreeTagger - a language independent part-of-speech tagger]
 
 
 
*[http://www.biomedcentral.com/info/about/datamining= Using BioMed Central's Open Access Full Text Corpus for Data Mining Research]
 
 
 
*[http://view.byu.edu/ VIEW: Variation in English Words and Phrases]
 
 
 
*[http://beta.visl.sdu.dk VISL Tagger and Parser]
 
 
 
*[http://www.webir.org/ Web IR & IE]
 
 
 
*[http://elib.cs.berkeley.edu/docfreq/ Web Term Document Frequency Form (Berkeley)]
 
 
 
*[http://www.niederlandistik.fu-berlin.de/cgi-bin/web-conc.cgi WEB-CONC]
 
 
 
*[http://www.webcorp.org.uk/ WEBCORP]
 
 
 
*[http://www.webexp.info/ WebExp2 Experimental Software]
 
 
 
*[http://www.comp.lancs.ac.uk/ucrel/bncfreq/ Word Frequencies in Written and Spoken English (Based on the British National Corpus)]
 
 
 
*[http://tcc.itc.it/research/textec/topics/disambiguation/wordnetdomains.html WordNet Domains]
 
 
 
*[http://www.ldc.upenn.edu/exploration/expl2000/papers/ Workshop on Web-Based Language Documentation and Description Papers]
 
 
 
*[http://www.gmi.org/wlms/ World Language Mapping System]
 
 
 
==PAPERS==
 
*[http://www.comp.leeds.ac.uk/eric/iwcs.ps A Domain-Independent Semantic Tagger for the Study of Meaning Associations in English Text]
 
 
 
*[http://www3.interscience.wiley.com/cgi-bin/abstract/104525215/ABSTRACT Automatic Construction of English/Chinese Parallel Corpora]
 
 
 
*[ftp://ftp.icsi.berkeley.edu/pub/techreports/ Berkeley - Technical Reports]
 
 
 
*[http://sslmit.unibo.it/~baroni/publications/lrec2004/bootcat_lrec_2004.pdf BootCaT: Bootstrapping Corpora and Terms from the Web]
 
 
 
*[http://acl.ldc.upenn.edu/I/I05/I05-2015.pdf Building an Annotated Japanese-Chinese Parallel Corpus]
 
 
 
*[http://www.ucl.ac.uk/english-usage/diachronic/index.htm DCPSE: Creating a Parsed and Searchable Diachronic Corpus of  Present-Day Spoken English]
 
 
 
*[http://www.ims.uni-stuttgart.de/info/EPapers.html Electronically available papers (list at Univ. of Stuttgart)]
 
 
 
*[http://www.cis.upenn.edu/~cliff-group/94/reports.html Linc Lab (U. Penn) technical reports (not on-line)]
 
 
 
*[http://www.let.rug.nl/~tanja/ Linguistic Knowledge and Word Sense Disambiguation]
 
 
 
*[http://ir.shef.ac.uk/cloughie/papers.html Measuring Text Reuse]
 
 
 
*[http://www.linguistics.rub.de/~kiss/publications/publications.html#boundaries Paper on Sentence Boundary Disambiguation]
 
 
 
*[http://www.dfki.de/lt/papers/cl-abstracts.html Papers of the DFKI CL Department]
 
 
 
*[http://www1.cs.columbia.edu/nlp/theses.html PhD Theses (Columbia Natural Processing Language Group)]
 
 
 
*[http://www.itri.brighton.ac.uk/ucnlg/Proceedings/index.html Proceedings of the Corpus Linguistics 2005 Workshop on Using Corpora for Natural Language Generation]
 
 
 
==SOFTWARE==
 
*[http://www.answerbus.com/index.shtml Answerbus -- Automatic Language Detection Software]
 
 
 
*[http://homepage.mac.com/bncweb/home.html BNCweb: A Web-Based Interface to the British National Corpus]
 
 
 
*[http://nlg18.csie.ntu.edu.tw:8080/opinion/index.html Chinese sentiment dictionary NTUSD]
 
 
 
*[http://www.athel.com/colloc.html Collocate]
 
 
 
*[http://lingo.stanford.edu/ CSLI LinGO Lab (Stanford)]
 
 
 
*[http://www.lsi.upc.es/~nlp/freeling/ FreeLing 1.1]
 
 
 
*[http://www.webir.org/resources.html IR and IE on the web]
 
 
 
*[http://www.comp.nus.edu.sg/~qiul/NLPTools/JavaRAP.html JavaRAP]
 
 
 
*[https://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library)]
 
 
 
*[http://xlex.uni-muenster.de/ MTP Xlex/www]
 
 
 
*[http://www.langsoft.ch Natural Language Processing software]
 
 
 
*[http://www.dfki.de/lt/registry/draft.html Natural Language Software Registry (at DFKI)]
 
 
 
*[http://www.nzdl.org/ELKB/ Roget's Thesaurus as an Electronic Lexical Knowledge Base]
 
 
 
*[http://www.chass.utoronto.ca/tact/ Text Analysis Computing Tools (TACT)]
 
 
 
*[http://odur.let.rug.nl/~vannoord/TextCat/ TextCat]
 
 
 
*[http://igm.univ-mlv.fr/~unitex/ Unitex]
 
 
 
*[http://www.mith2.umd.edu/products/ver-mach/ Versioning Machine 2.0]
 
===SOFTWARE - APPLICATIONS===
 
*[http://www.d.umn.edu/~tpederse/code.html Bigram Statistics Package]
 
 
 
*[http://webdeptos.uma.es/filifa/personal/amoreno/indexer/ BNC Indexer]
 
 
 
*[http://www.brainhat.com/ Brainhat Natural Language Processing]
 
 
 
*[http://www.chilibot.net/ Chilibot: NLP based miner for gene/protein/keyword relationships]
 
 
 
*[http://www.bultreebank.org/clark CLaRK System]
 
 
 
*[http://delphesintl.com/ Delphes Technologies International]
 
 
 
*[http://www.dtreg.com DTREG 2.0 decision trees with TreeBoost]
 
 
 
*[http://www.dfki.de/lt/registry/apps/korek21.html KOREKTOR 2.0 (at the DFKI NLP archive)]
 
 
 
*[http://www.xs4all.nl/~bsarempt/linguistics/index.html KURA 1.0]
 
 
 
*[http://www.ccs.neu.edu/home/futrelle/bionlp/commercial/opus.html Opus, a commercial biology text mining system]
 
 
 
*[http://sourceforge.net/projects/pytalk/ Project: Pytalk]
 
 
 
*[http://www.wagsoft.com/RSTTool/ Release of RSTTool: RSTTool 2.7]
 
 
 
*[http://www.softissimo.com/ SOFTISSIMO]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html TreeTagger]
 
 
 
 
 
 
 
===SOFTWARE - FONTS===
 
*[http://attavik.ca/en_index.html Attavik.net]
 
 
 
 
 
===SOFTWARE - GENERATION===
 
*[http://kwicfinder.com/kfNgram/kfNgramHelp.html kfNgram]
 
 
 
*[http://www.stir.ac.uk/english/communication/Computational-tools KPML]
 
 
 
*[http://www.isi.edu/licensed-sw/halogen/index.html The HALogen Natural Language Generation system]
 
 
 
===SOFTWARE - KNREP===
 
*[http://www.linguateca.pt/corpografo/ Corpografo 2]
 
 
 
*[http://www.conceptnet.org The ConceptNet Project V2.1]
 
 
 
===SOFTWARE - MISC===
 
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/allen/0.html Code from James Allen's "Natural Language Understanding" (code at CMU)]
 
 
 
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/nlp_pp/0.html Code from Michael Covington's "NLP for Prolog Programmers" (code at CMU)]
 
 
 
*[http://www.dtreg.com/ DTREG decision tree generator]
 
 
 
*[http://www.sics.se/ps/sicstus.html SICStus - a Prolog environment]
 
 
 
===SOFTWARE - MORPHOLOGY===
 
 
 
*[http://www.cs.jhu.edu/~brill/RBT1_14.tar.Z Brill Tagger (Supervised, Trainable)]
 
 
 
*[http://delphesintl.com/ Delphes Technologies International, natural language processing.]
 
 
 
*[http://www.univ-nancy2.fr/pers/namer/Telecharger_Flemm.html FLEMMV31 - Inflectional morphology parser for French]
 
 
 
*[http://services.canoo.com/MorphologyBrowser.html German Morphology Browser]
 
 
 
*[http://www.cs.technion.ac.il/~erelsgl/bxi/hmntx/teud.html Hebrew Morphological Parser]
 
 
 
*[http://nlp.postech.ac.kr/DownLoad/k_api.html Korean morphological analyzer and part-of-speech tagger]
 
 
 
*[http://bach.arts.kuleuven.ac.be/~piet/morlex/index.html MORLEX - A lexical database for French]
 
 
 
*[http://sslmitdev-online.sslmit.unibo.it/linguistics/morph-it.php Morph-It! version 0.31]
 
 
 
*[http://www.cis.hut.fi/morphochallenge2005/ MorphoChallenge]
 
 
 
*[http://www.glue.umd.edu/~kareem/research/ Sebawai: An Arabic Morphological Analyzer]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/SFST.html Stuttgart Finite State Transducer Tools]
 
 
 
===SOFTWARE - MT===
 
*[http://www.travlang.com/Ergane/ Ergane]
 
 
 
*[http://www.lsi.upc.edu/%7Enlp/IQMT/ IQMT Framework for MT Evaluation]
 
 
 
*[http://www.isi.edu/natural-language/people/germann/software/ReWrite-Decoder/index.html ISI rewrite decoder]
 
 
 
*[http://www.scitechint.com/slp Resource for professional-quality language translation tools.]
 
 
 
===SOFTWARE - MULTI===
 
*[http://bach.arts.kuleuven.ac.be/~piet/fr_nlp.html Natural Language Processing for French]
 
 
 
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/ TIGERSearch - tools for linguistic text exploration]
 
 
 
===SOFTWARE - MULTILINGUAL===
 
*[http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html#Spanish-dicts Dictionaries for International Ispell]
 
 
 
*[http://let.dfki.uni-sb.de/~heinz/liste.php3 list of systems in multiple languages]
 
 
 
*[http://www.lpl.univ-aix.fr/projects/multext/MtRecode/ MtRecode - Character conversion program]
 
 
 
*[http://www.lpl.univ-aix.fr/projects/multext/MtScript/ MtScript - The Multext multi-lingual text editor]
 
 
 
*[http://www.knowledge.co.uk/xxx/ Multilingual PC software]
 
 
 
*[http://www.scitechint.com/slp Resource for high-quality tools supporting multi-lingual communication]
 
 
 
===SOFTWARE - PHONOLOGY===
 
*[http://ling.ucsd.edu/~barker/Syllables/index.txt How Many Syllables Does English Have?]
 
 
 
*[http://mypage.siu.edu/lhartman/ Phono- Sound Change Model Software]
 
===SOFTWARE - SEMANTICS===
 
*[http://www.intellexer.com/ Intellexer - Natural Language Searching Technologies]
 
 
 
*[http://www.clres.com/prepositions.html Preposition Project]
 
 
 
*[http://senseclusters.sourceforge.net/ SenseClusters]
 
 
 
*[http://www.comp.lancs.ac.uk/ucrel/usas/ UCREL Semantic Analysis System]
 
 
 
===SOFTWARE - SPEECH===
 
 
 
*[http://www.speech.cs.cmu.edu/sphinx/ CMU Sphinx Group: Open Source Speech Recognition Engines]
 
 
 
*[http://www.cs.ubc.ca/~murphyk/Software/HMM/hmm.html Hidden Markov Model (HMM) Toolbox for Matlab]
 
 
 
*[http://nextens.uvt.nl/ NeXTeNS - Dutch Extension for Text to Speech]
 
 
 
*[http://bach.arts.kuleuven.ac.be/pmertens/prosogram/ Prosogram]
 
 
 
*[http://ilk.uvt.nl/g2p-www-demo.html TreeTalk: Memory - Based Grapheme - Phoneme Conversion Demo]
 
 
 
*[http://download.com.com/3000-2130-10277576.html?tag=lst-0-1 Verbot preview 4.0]
 
 
 
===SOFTWARE - SYNTAX===
 
*[http://www.comp.leeds.ac.uk/amalgam/amalgam/amalghome.htm AMALGAM project]
 
 
 
*[http://lael.pucsp.br/corpora/segmentador/ CEPRIL -  Portugese Segmenter]
 
 
 
*[http://www.collectivelanguage.com/demo.html Collective (Chaotic - Emergent) Language]
 
 
 
*[http://search.cpan.org/dist/Lingua-EN-Sentence/ CPAN Lingua EN Sentence Splitter]
 
 
 
*[http://search.cpan.org/dist/Lingua-HE-Sentence/ CPAN Lingua HE Sentence Splitter]
 
 
 
*[http://search.cpan.org/~tgrose/HTML-Summary-0.017/ CSPAN Sentence Splitter]
 
 
 
*[http://gate.ac.uk GATE, A General Architecture for Text Engineering]
 
 
 
*[http://www.infogistics.com/posdemo.htm Infogistic - NLProcessor Interactive Demo]
 
 
 
*[http://www.andy-roberts.net/software/jTokeniser jTokeniser]
 
 
 
*[http://openccg.sourceforge.net/ OpenCCG]
 
 
 
*[http://www.coli.uni-sb.de/~thorsten/tnt/ Saarland University, Computational Linguistics]
 
 
 
*[http://www.ling.helsinki.fi/~tapanain/dg/ Syntactic dependency parser for English]
 
 
 
*[http://www.ece.ubc.ca/~donaldd/treeform.htm TreeForm Syntax Tree Drawing Software]
 
 
 
*[http://bach.arts.kuleuven.ac.be/~piet/vertex/index.html VERTEX - A chart parser for unification grammars (French)]
 
 
 
*[http://visl.hum.sdu.dk/visl/ VISL - Visual Interactive Syntax Learning]
 
 
 
===SOFTWARE - TOOLS===
 
*[http://www.ldc.upenn.edu/Projects/ACE/Tools/  Automatic Content Extraction (ACE): Annotation Tools]
 
 
 
*[http://www.norvig.com/paip/grammar.lisp a simple grammar of English]
 
 
 
*[http://sourceforge.net/projects/acopost/ ACOPOST]
 
 
 
*[http://acdc.linguateca.pt/example_alignment.html Alignment of bilingual corpora performed with EasyAlign]
 
 
 
*[http://www.lsi.upc.es/~lambert/software/AlignmentSet.html Alignment Set Toolkit]
 
 
 
*[http://www.clres.com/WordNet.html alphabetic version of WordNet 2.0]
 
 
 
*[http://lucene.apache.org/java/docs/ Apache Lucene]
 
 
 
*[http://www.arabeyes.org/ Arabeyes Project]
 
 
 
*[http://members.aol.com/gnhbos/ocr.htm Aramedia]
 
 
 
*[http://www.umiacs.umd.edu/~jimmylin/downloads/index.html Aranea Question Answering System]
 
 
 
*[http://www.ltg.ed.ac.uk/~jo/interarbora/ Arbora Tree Delivery Service]
 
 
 
*[http://misshoover.si.umich.edu/~zzheng/sentence/ Automatic English Sentence Segmentation]
 
 
 
*[http://www.clg.wlv.ac.uk/projects/CAST/demos.php Automatic Summarization Demos]
 
 
 
*[http://www.r.dl.itc.u-tokyo.ac.jp/~nakagawa/resource/termext/atr-e.html Automatic Term Extraction System]
 
 
 
*[http://lael.pucsp.br/corpora/ Bancos de dados e Ferramentas de an`alise]
 
 
 
*[http://www.ai.mit.edu/~murphyk/Software/BNT/bnt.html Bayes Net Toolbox for Matlab]
 
 
 
*[http://bndev.sourceforge.net/ Bayesian Network tools in Java (BNJ)]
 
 
 
*[http://sslmit.unibo.it/~baroni/bootcat.html BootCaT: Simple Utilities to Bootstrap Corpora and Terms from the Web]
 
 
 
*[http://callisto.mitre.org/ Callisto Annotation Tool]
 
 
 
*[http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2005T13 CCGBank]
 
 
 
*[http://lael.pucsp.br/corpora/alinhador/ CEPRIL aligner]
 
 
 
*[http://pie.usna.edu/explorec.html Chargrams Database from British National Corpus]
 
 
 
*[http://www.bultreebank.org/clark/index.html CLaRK System]
 
 
 
*[http://dlt4.mit.edu/~dr/COALS/ COALS: Correlated Occurrence Analogue to Lexical Semantics]
 
 
 
*[ftp://cs.nyu.edu/pub/html/comlex.html/ Comlex]
 
 
 
*[http://www.ai.mit.edu/projects/iiip/doc/cl-http/home-page.html Common Lisp Hypermedia Server]
 
 
 
*[http://www.cpan.org/ Comprehensive Perl Archive Network]
 
 
 
*[http://clg.wlv.ac.uk/projects/CAST/ Computer Aided Summarisation Tool (CAST)]
 
 
 
*[http://infomap.stanford.edu/webdemo Concept Search Engine Information Mapping Demo (Center for the Study of Language and Information, Stanford University)]
 
 
 
*[https://sourceforge.net/projects/concollate/ Concollate]
 
 
 
*[http://borel.slu.edu/crubadan/ Corpus building for minority languages]
 
 
 
*[http://montev.isi.edu:8000/align-tool/?CORPUS=de-news-morphix&AFILE=full-model1-50-50.gz Corpus De-News-Morphix Alignment Tool]
 
 
 
*[http://search.cpan.org/dist/SuffixTree/ CPAN Suffix Tree Module]
 
 
 
*[http://www.ucl.ac.uk/english-usage/diachronic/index.htm Creating a Parsed and Searchable Diachronic Corpus of Present-Day Spoken English]
 
 
 
*[http://www.cis.upenn.edu/~dbikel/software.html#wn Dan Bikel's Java WordNet Library]
 
 
 
*[http://www.dataharmony.com/ Data Harmony, Document Management Software]
 
 
 
*[http://www.cs.ualberta.ca/~lindek/demos.htm Demos of dependency database, parser, and other tools]
 
 
 
*[http://fuzzy.cs.uni-magdeburg.de/~borgelt/dtree.html Dtree - Decision and Regression Tree Induction]
 
 
 
*[http://www.foreignword.com/dictionary/truespel/transpel.htm English-Truespel (USA Accent) Text Conversion Tool]
 
 
 
*[http://www.cs.jhu.edu/~brill/ Eric Brill's Part of Speech Tagger]
 
 
 
*[http://odur.let.rug.nl/~vannoord/Fsa/Manual/node1.html Finite State Automata Utilities v6]
 
 
 
*[http://www.jaist.ac.jp/~hieuxuan/flexcrfs/flexcrfs.html FlexCRFs: Flexible Conditional Random Fields]
 
 
 
*[http://garraf.epsevg.upc.es/freeling/ FreeLing 1.2]
 
 
 
*[http://grid.let.rug.nl/~vannoord/Fsa/fsa.html FSA6.2xx: Finite State Automata Utilities]
 
 
 
*[http://gate.ac.uk/ GATE (General Architecture for Text Engineering)]
 
 
 
*[http://www.clsp.jhu.edu/ws2005/groups/statistical/GenPar.html GenPar Toolkit for Generalized Parsing]
 
 
 
*[http://www.parc.xerox.com/istl/groups/nltt/medley/ Grammar Writer's Workbench for Lexical Functional Grammar]
 
 
 
*[http://htk.eng.cam.ac.uk Hidden Markov Model Toolkit]
 
 
 
*[http://www.ida.liu.se/~nlplab/ILink/ I*Link]
 
 
 
*[http://www.kbsim.com/ifind.html iFind KBSim.com - Knowledge-Based Simulations, Inc.]
 
 
 
*[http://www.infogistics.com/posdemo.htm Infogistics: NLProcessor Interactive Demo]
 
 
 
*[http://www.isi.edu/~marcu/software.html ISI's version of the RSTTool]
 
 
 
*[http://www-2.cs.cmu.edu/~javabayes/Home/ JavaBayes - v0.346]
 
 
 
*[http://www.comp.leeds.ac.uk/andyr/software/jTokeniser/ jTokeniser]
 
 
 
*[http://sourceforge.net/projects/jwordnet/ JWNL (Java WordNet Library)]
 
 
 
*[http://sslmit.unibo.it/%7ebaroni/welcome_to_knorpora.html Knorpora 1.0]
 
 
 
*[http://miniappolis.com/KWiCFinder/KWiCFinderHome.html KWiCFinder]
 
 
 
*[http://www.kwicfinder.com/KWiCFinder.html Kwicfinder]
 
 
 
*[http://odur.let.rug.nl/~vannoord/TextCat/competitors.html Language Identification Tools]
 
 
 
*[http://www-2.cs.cmu.edu/~lemur/download.html Lemur Toolkit Download]
 
 
 
*[http://www.lemurproject.org/ Lemur Toolkit Website]
 
 
 
*[http://www.leximancer.com/ Leximancer]
 
 
 
*[http://www.csie.ntu.edu.tw/~cjlin/libsvm/ LIBSVM: A Library for Support Vector Machines]
 
 
 
*[http://www.alias-i.com/lingpipe/ LingPipe]
 
 
 
*[http://search.cpan.org/~lgoddard/Lingua-Syllable-0.03/Syllable.pm Lingua-Syllable]
 
 
 
*[http://listserv.linguistlist.org/cgi-bin/wa?A2=ind0109&L=corpora&P=R729 list of POS taggers]
 
 
 
*[http://ucrel.lancs.ac.uk/llwizard.html Log-likelihood calculator]
 
 
 
*[ftp://ftp.ncbi.nlm.nih.gov/pub/lsmith/MedPost/medpost.tar.gz MedPost: A Part-of-Speech Tagger for BioMedical text]
 
 
 
*[http://www.lexically.net/wordsmith/version4/index.htm Mike Scott's Web - Wordsmith Tools]
 
 
 
*[http://mmax.eml-research.de MMAX Annotation Tool]
 
 
 
*[http://www.dcs.shef.ac.uk/research/ilash/Moby/ Moby Database]
 
 
 
*[http://search.cpan.org/author/SHLOMOY/Lingua-EN-Sentence-0.25/lib/Lingua/EN/Sentence.pm Module for splitting text into sentences]
 
 
 
*[http://www.cs.berkeley.edu/~aiken/moss.html Moss: A System for Detecting Software Plagiarism]
 
 
 
*[http://www.clsp.jhu.edu/ws2005/groups/statistical/mtv.html Multitree Viewer (MTV)]
 
 
 
*[http://www.natlantech.com/lingbench_ide.html Natlanco]
 
 
 
*[http://www.cs.jhu.edu/~brill/code.html Natural Language Processing Systems]
 
 
 
*[http://www.ltg.ed.ac.uk/NITE/ NITE XML Toolkit]
 
 
 
*[http://nltk.sourceforge.net NLTK - Natural Language Toolkit]
 
 
 
*[http://crl.nmsu.edu/Tools/Software/ NMSU Natural Language Processing Tools]
 
 
 
*[http://annotation.semanticweb.org/ontomat/index.html Ontomat Homepage]
 
 
 
*[http://teach-computers.org/word-expert.html Open Mind]
 
 
 
*[http://davinci.cs.ucdavis.edu/ OpenRCT Home]
 
 
 
*[http://www.oriel.org/homonym.htm ORIEL -- Online Research Information Environment for the Life Sciences]
 
 
 
*[http://clg.wlv.ac.uk/projects/PALinkA/ PALinkA: A Resource Annotation Tool]
 
 
 
*[http://www.sil.org/ PC-KIMMO, Englex, PC-PATR, and PC-PARSE]
 
 
 
*[http://www.bluem.net/downloads/pdftotext_en/ PDF to Text]
 
 
 
*[http://wall.jussieu.fr/dyn/Context2 perl concordancer]
 
 
 
*[http://www.ai.mit.edu/~jrennie/WordNet/ Perl interface to WordNet]
 
 
 
*[http://www.tartarus.org/~martin/PorterStemmer/index.html Porter Stemming Algorithm]
 
 
 
*[http://www.sciences.univ-nantes.fr/info/perso/permanents/enguehard/recherche/CoRRecT/CoRRecT_gb.htm Project CoRRecT: Reference Corpus for the Recognition of Terms]
 
 
 
*[http://protege.stanford.edu/ Protege Project]
 
 
 
*[http://www.lingsoft.fi/cgi-pub/engcg Publically available POS tagger]
 
 
 
*[http://www.openchannelsoftware.org/projects/Qanda Qanda: Open source question answering system]
 
 
 
*[http://corpus.leeds.ac.uk/query-zh.html Query to Chinese Corpora]
 
 
 
*[http://www-rali.iro.umontreal.ca/Reacc/ R&eacute;acc - reaccenting software]
 
 
 
*[http://rdues.uce.ac.uk/acronym.shtml RDUES ACRONYM (Automatic Collocational Retrieval of NYMs) Project]
 
 
 
*[http://www.comp.nus.edu.sg/~rpnlpir/daemonCollins/ README for the daemonized version of Collins' Parser]
 
 
 
*[http://www.research-lab.com/ Research-lab.com]
 
 
 
*[http://hacks.oreilly.com/pub/h/1011 Robot Karaoke]
 
 
 
*[http://www.reitter-it-media.de/compling/index.html RST LaTeX  (Reitter IT and Media)]
 
 
 
*[http://herzberg.ca.sandia.gov/jess/index.shtml Rule Engine for the Java Platform]
 
 
 
*[http://elib.cs.berkeley.edu/src/satz/ SATZ--Adaptive Sentence Boundary Detector]
 
 
 
*[http://ixa.si.ehu.es/Ixa/resources/selprefs Selectional Preferences Extracted from Semcor for WordNet 1.6 Synsets (v 1.0)]
 
 
 
*[http://ilk.uvt.nl/~sabine/chunklink/ Software - The chunklink script, by Sabine Buchholz]
 
 
 
*[http://people.csail.mit.edu/people/mcollins/code.html Software and Data Sets for Collins Natural Language Parser]
 
 
 
*[http://senta.di.ubi.pt Software for the Extraction of N-ary Textual Associations (SENTA)]
 
 
 
*[http://www-a2k.is.tokushima-u.ac.jp/member/kita/NLP/nlp_tools.html Software Tools for NLP]
 
 
 
*[http://www-nlp.stanford.edu/software/lex-parser.shtml Stanford Parser]
 
 
 
*[http://start.csail.mit.edu/ START Natural Language Question Answering System]
 
 
 
*[http://www.lsi.upc.edu/%7Enlp/SVMTool/ SVMTool]
 
 
 
*[http://swesum.nada.kth.se/index-eng.html SweSum - Automatic Text Summarizer (with PRM)]
 
 
 
*[http://www.wagsoft.com/Coder/ Systemic Coder -- a Text Markup Tool (Version 4.5)]
 
 
 
*[http://www-2.cs.cmu.edu/~lenzo/t2p/ t2p: Text-to-Phoneme Converter Builder]
 
 
 
*[http://www.d.umn.edu/~tpederse/parallel.html Ted Pedersen - Tools for Parallel Text]
 
 
 
*[http://lsi.research.telcordia.com/ Telcordia Latent Semantic Indexing  Demo Machine]
 
 
 
*[http://www.tei-c.org/Software/index.html Text Encoding Initiative --Tools]
 
 
 
*[http://www.comp.lancs.ac.uk/computing/research/ucrel/claws/tagservice.html The CLAWS tagging service]
 
 
 
*[http://www.clsp.jhu.edu/ws99/projects/mt/toolkit/ The EGYPT Statistical Machine Translation Toolkit]
 
 
 
*[http://www.ims.uni-stuttgart.de/CorpusToolbox/ The IMS Corpus Toolbox Webpage]
 
 
 
*[http://jazzy.sourceforge.net/ The Java Open Source Spell Checker]
 
 
 
*[http://www.findingnames.net/ The Naming Company]
 
 
 
*[http://stp.ling.uu.se/~corpora/plug/pwa/ The PLUG Word Aligner - PWA]
 
 
 
*[http://timex2.mitre.org/taggers/timex2_taggers.html TIMEX2 Taggers]
 
 
 
*[http://www.coli.uni-sb.de/~thorsten/tnt/ TnT - Statistical Part-of-Speech Tagger]
 
 
 
*[ftp://ftp.ids-mannheim.de/kt/CSSCCb-4.0.tar.bz2 Tool for Identifying Duplicates in a Large Document Collection]
 
 
 
*[http://www.cs.columbia.edu/nlp/tools.html Tools developed at Columbia University (FUF, Surge, Crep, Segmenter, Verber, Xtract)]
 
 
 
*[http://www.torch.ch Torch3]
 
 
 
*[http://main.amu.edu.pl/~sipkadan/lingo.htm Turbo Lingo]
 
 
 
*[http://stp.ling.uu.se/cgi-bin/joerg/Uplug Uplug]
 
 
 
*[http://wordlist.sourceforge.net/varcon-readme VarCon (Variant Conversion Info)]
 
 
 
*[http://www.edict.com.hk/concordance/ Virtual Language Centre's Web Concordancer]
 
 
 
*[http://www.textanalysis.com/help/help.htm Visual Text - reference documentation]
 
 
 
*[http://www.textanalysis.com/ VisualText]
 
 
 
*[http://webglimpse.net/ Webglimpse]
 
 
 
*[http://search.cpan.org/dist/WordNet-SenseRelate Word-Net SenseRelate]
 
 
 
*[http://search.cpan.org/dist/WordNet-Similarity Word-Net Similarity]
 
 
 
*[http://sourceforge.net/projects/wordfreak Wordfreak]
 
 
 
*[http://www.cogsci.princeton.edu/~wn/ Wordnet]
 
 
 
*[http://www.d.umn.edu/~tpederse/similarity.html WordNet-Similarity PERL Module]
 
 
 
*[http://www.d.umn.edu/~tpederse/wsdshell.html WSD Shell]
 
 
 
*[http://www.xml-ces.org/ XCES: Corpus Encoding Standard for XML]
 
 
 
==TOOLS==
 
*[http://sslmit.unibo.it/~baroni/bootcat.html BootCaT Toolkit: Simple Utilities to Bootstrap Corpora and Terms from the Web]
 
 
 
*[http://chasen.aist-nara.ac.jp/hiki/ChaSen/ ChaSen]
 
 
 
*[http://nlp.cs.jhu.edu/~gsm/pd_demo Dendrogram Demo]
 
 
 
*[http://www.olst.umontreal.ca/dicoeng.html DiCo Lexical Database OLST]
 
 
 
*[http://www.wolfson.ox.ac.uk/~peet/eatshow.htm Edinburgh Associative Thesaurus]
 
 
 
*[http://www.smi.ucd.ie/hyppia/ HYPPIA]
 
 
 
*[http://xlex.uni-muenster.de/ M&uuml;nster Tagging Project]
 
 
 
*[http://nltk.sourceforge.net Natural Language Toolkit (NLTK)]
 
 
 
*[http://perso.wanadoo.fr/rosavram/ NooJ]
 
 
 
*[http://www.informatics.susx.ac.uk/research/nlp/rasp/ Robust Accurate Statistical Parsing (RASP)]
 
 
 
*[http://www.searchtools.com/ Search Tools for Web Sites and Intranets]
 
 
 
*[http://www.lsi.upc.edu/~surdeanu/swirl.html SwiRL Semantic Role Labeler]
 
 
 
*[http://view.byu.edu VIEW (Variation in English Words and Phrases)]
 
 
 
==UNCATEGORIZED==
 
*[http://www40.brinkster.com/dictionarium/index.html dictionarium]
 
 
 
*[http://www.mat.upm.es/~aries ARIES Natural Language Tools]
 
 
 
*[http://www.york.ac.uk/services/library/subjects/langint.htm Language and Linguistic Science information sources]
 
 
 
*[http://www.de.elra.research.ec.org/ The RELATOR language resources server]
 

Latest revision as of 09:45, 17 June 2015