|
|
| (155 intermediate revisions by 9 users not shown) |
| Line 1: |
Line 1: |
| ==A==
| | For other languages, see [[List of resources by language]]. |
|
| |
|
| *[http://www.cs.cmu.edu/afs/cs/academic/class/11761-s97/WWW/index.html 11-761 Language and Statistics, course at CMU, Spring 1997]
| | See also [[Multilingual resources]]. |
| *[ftp://ftp.cs.cornell.edu/pub/smart/time/ 1963 Time Magazine corpus]
| |
| *[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html 2000 NIST Speaker Recognition Evaluation Corpus]
| |
| *[http://www.linguistics.ucla.edu/nasslli04/index.html 3rd NASSLLI: North American Summer School in Logic, Language and Information]
| |
| *[http://www.linguistics.ucla.edu/nasslli04/index.html 3rd North American Summer School in Logic, Language and Information]
| |
| *[http://www.ldc.upenn.edu/exploration/survey.html A Survey of Open Language Archives]
| |
| *[http://www.coli.uni-sb.de/sfb378/negra-corpus/ A Syntactically Annotated Corpus of German Newspaper Texts]
| |
| *[http://www.acm.org/tois/ ACM Transactions on Information Systems]
| |
| *[http://www.cs.kun.nl/agfl/ AFGL Parser Generator]
| |
| *[http://www.dfki.de/lt/registry/generation/alfr21-x.html AL FRESCO Interactive System (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/multi/alfr21.html AL FRESCO Interactive System (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/multi/ale.html ALE -- Attribute Logic Engine (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/multi/alep-x.html ALEP (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/nlp_tools/alep.html ALEP (at the DFKI NLP archive)]
| |
| *[http://www.comp.leeds.ac.uk/amalgam/amalgam/amalghome.htm AMALGAM project ]
| |
| *[http://americannationalcorpus.org/FirstRelease/ AMERICAN NATIONAL CORPUS FIRST RELEASE ]
| |
| *[http://www.mat.upm.es/~aries ARIES Natural Language Tools]
| |
| *[http://www.dfki.de/lt/registry/parsers/avparser.html AV parser (at the DFKI NLP archive)]
| |
| *[http://www.awl-he.com/ Addison Wesley Longman higher education]
| |
| *[http://groups.yahoo.com/group/linguaffix/ Agglutination on the Basis of Corpus Information]
| |
| *[http://acdc.linguateca.pt/example_alignment.html Alignment of bilingual corpora performed with EasyAlign ]
| |
| *[http://odur.let.rug.nl/~vannoord/trees/ Alpino Treebank]
| |
| *[http://www.notam.uio.no/~hcholm/altlang/ Alternative dictionaries]
| |
| *[http://www.dfki.de/lt/registry/multi/alvey4.html Alvey Natural Language Tools (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/sem_and_prag/alvey4-x.html Alvey Natural Language Tools (at the DFKI NLP archive)]
| |
| *[http://www.elda.fr/catalogue/en/speech/S0115.html American English SpeechDat-Car]
| |
| *[http://www.elda.fr/catalogue/en/speech/S0115.html American English SpeechDat-Car]
| |
| *[http://www.cornelsen.de/international/ An Empirical Grammar of the English Verb System]
| |
| *[http://www.sultry.arts.usyd.edu.au/links/statnlp.html Annotated list of resources on statistical NLP and corpus-based CL]
| |
| *[http://www.ldc.upenn.edu/Catalog/LDC2001T55.html Arabic Newswire Part 1]
| |
| *[http://members.aol.com/gnhbos/ocr.htm Aramedia]
| |
| *[http://www.ltg.ed.ac.uk/~jo/interarbora/ Arbora Tree Delivery Service]
| |
| *[http://www.cambridge.org/ Architectures and Mechanisms for Language Processing]
| |
| *[http://www.a-i.com/ Artificial Intelligence NV (Ai)]
| |
| *[http://www.eprints.org/ Author/Institution Self-Archiving]
| |
| *[http://misshoover.si.umich.edu/~zzheng/sentence/ Automatic English Sentence Segmentation]
| |
| *[http://www.r.dl.itc.u-tokyo.ac.jp/~nakagawa/resource/termext/atr-e.html Automatic Term Extraction System]
| |
|
| |
|
| ==B==
| | <!-- Please keep this list in alphabetical order --> |
| | * [[Corpora for English|Corpora]] |
| | * [[Dictionaries (English)|Dictionaries]] |
| | * [[Generation grammars]] |
| | * [[Geographical words (English)|Geographical words]] |
| | * [[Knowledge collections and datasets (English)|Knowledge collections and datasets]] |
| | * [[Lexicons (English)|Lexicons]] |
| | * [[Subject specific resources (English)|Subject specific resources]] |
| | * [[Tools and Software for English|Tools and Software]] |
| | * [[Uncategorized resources]] - ''please help in categorizing'' |
|
| |
|
| *[http://www.dfki.de/lt/registry/multi/bim2.html BIM LOQUI (at the DFKI NLP archive)]
| | ==Other resource lists== |
| *[http://webdeptos.uma.es/filifa/personal/amoreno/indexer/ BNC Indexer] | | * [[Lists of resources|Other lists of resources]] |
| *[http://thetis.bl.uk/ BNC Online Service]
| |
| *[http://homepage.mac.com/bncweb/ BNCweb a web-based interface to the British National Corpus]
| |
| *[http://info.ox.ac.uk/bnc/ BRITISH NATIONAL CORPUS - WORLD EDITION]
| |
| *[http://lael.pucsp.br/corpora/ Bancos de dados e Ferramentas de an\`alise ]
| |
| *[http://www.dfki.de/lt/registry/data_sets/form_reduction-x.html Base form reduction and search form production (at the DFKI NLP archive)]
| |
| *[http://www.ai.mit.edu/~murphyk/Software/BNT/bnt.html Bayes Net Toolbox for Matlab]
| |
| *[http://www.ai.mit.edu/~murphyk/Software/BNT/bnt.html Bayes Net Toolbox for Matlab]
| |
| *[http://bndev.sourceforge.net/ Bayesian Network tools in Java (BNJ) ]
| |
| *[http://bndev.sourceforge.net/ Bayesian Network tools in Java (BNJ) ]
| |
| *[http://clwww.essex.ac.uk/search/ Bibliographic Search Page, Univ. of Essex]
| |
| *[http://www.uni-frankfurt.de/~ifb/bibabfrage.html Bibliography for Phonetics/Speech Technology]
| |
| *[http://www.cs.berkeley.edu/~russell/aima.html Bibliography to the book "Artificial Intelligence: A Modern Approach by Russell and Norvig ]
| |
| *[http://www.d.umn.edu/~tpederse/code.html Bigram Statistics Package]
| |
| *[http://www.elra.info Bilingual Dictionary French Arabic]
| |
| *[http://www.cambridge.org/ Bilingual Speech: A Typology of Code-Mixing]
| |
| *[http://devoted.to/corpora Bookmarks for Corpus-based Linguists ]
| |
| *[http://www.academicpress.com/b\&l/ Brain and Language]
| |
| *[http://www.brainhat.com/ Brainhat Natural Language Processing]
| |
| *[http://www.cs.jhu.edu/~brill/RBT1_14.tar.Z Brill Tagger (Supervised, Trainable)]
| |
| *[http://www.dfki.de/lt/registry/data_sets/beep.html British English Example Pronunciations (BEEP) (at the DFKI NLP archive)]
| |
|
| |
|
| ==C== | | ==Additional information== |
| | <!-- Please keep this list in alphabetical order --> |
|
| |
|
| *[http://www.dfki.de/lt/registry/multi/cat2.html CAT2 (at the DFKI NLP archive)] | | * [[Anthology Statistics]] |
| *[http://www.dfki.de/lt/registry/generation/cat2-x.html CAT2(at the DFKI NLP archive)]
| | * [[Bibliographies]] |
| *[http://www.kun.nl/celex CELEX - The Dutch Center for Lexical Information]
| | * [[Blogs]] |
| *[http://lael.pucsp.br/corpora/segmentador/ CEPRIL - Portugese Segmenter] | | * [[Books]] |
| *[http://lael.pucsp.br/corpora/alinhador/ CEPRIL aligner ]
| | * [[Conferences]] |
| *[http://www.dfki.de/lt/registry/parsers/cfg.html CFG parser (at the DFKI NLP archive)]
| | * [[Courses]] |
| *[http://www.dfki.de/lt/registry/generation/charon-x.html CHARON (at the DFKI NLP archive)] | | * [[Journals]] |
| *[http://www.dfki.de/lt/registry/parsers/charon.html CHARON (at the DFKI NLP archive)]
| | * [[Newsgroups, mailing lists|Newsgroups and mailing lists]] |
| *[http://www.bultreebank.org/clark/index.html CLaRK System ]
| | * [[Papers]] |
| *[http://www.bultreebank.org/clark CLaRK System] | |
| *[http://www.speech.cs.cmu.edu/sphinx/ CMU Sphinx Group: Open Source Speech Recognition Engines ]
| |
| *[http://www.dfki.de/lt/registry/nlp_tools/cognate.html COGNATE (at the DFKI NLP archive)]
| |
| *[http://www.linguateca.pt/COMPARA/ COMPARA corpus] | |
| *[http://www.dfki.de/lt/registry/nlp_tools/compulexis2.html COMPULEXIS (at the DFKI NLP archive)]
| |
| *[http://www.academicpress.com/csl/ COMPUTER SPEECH AND LANGUAGE ]
| |
| *[http://www.copernic.com/ COPERNIC 2000] | |
| *[http://www.corpusdelespanol.org/ CORPUS DEL ESPANOL]
| |
| *[http://corpora.ids-mannheim.de/~cosmas/ COSMAS II ]
| |
| *[http://search.cpan.org/dist/Lingua-EN-Sentence/ CPAN Lingua EN Sentence Splitter] | |
| *[http://search.cpan.org/dist/Lingua-HE-Sentence/ CPAN Lingua HE Sentence Splitter]
| |
| *[http://search.cpan.org/dist/SuffixTree/ CPAN Suffix Tree Module]
| |
| *[http://corpus.rae.es/creanet.html CREA] | |
| *[http://corpus.rae.es/creanet.html CREA]
| |
| *[http://courses.cs.cornell.edu/cs674/2000SP/ CS674: Natural Language Processing (Cornell U., Spring 2000)]
| |
| *[http://lingo.stanford.edu/ CSLI LinGO Lab (Stanford)]
| |
| *[http://search.cpan.org/~tgrose/HTML-Summary-0.017/ CSPAN Sentence Splitter]
| |
| *[http://www.dfki.de/lt/registry/formalisms/cuf.html CUF (at the DFKI NLP archive)]
| |
| *[http://dictionary.cambridge.org/researchers.htm Cambridge Learner Dictionary]
| |
| *[http://www.canoo.net/ Canoo.net - German Dictionaries and Grammars]
| |
| *[http://www.cascadilla.com/ Cascadilla Press]
| |
| *[http://www.cdc.gov/ncidod/sars/languages.htm Centre for Disease Control - Chinese, French, Japanese, Spanish info on SARS] | |
| *[http://www.chilibot.net/ Chilibot: NLP based miner for gene/protein/keyword relationships]
| |
| *[http://www.chilibot.net/ Chilibot: NLP based miner for gene/protein/keyword relationships]
| |
| *[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/allen/0.html Code from James Allen's "Natural Language Understanding" (code at CMU) ]
| |
| *[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/bookcode/nlp_pp/0.html Code from Michael Covington's "NLP for Prolog Programmers" (code at CMU)]
| |
| *[http://www.elsevier.nl/freeinfo/mathcompcog/505626/505626.htm Cognition, a journal from Elsevier Science]
| |
| *[http://www.dcs.gla.ac.uk/idom/ir_resources/ Collections of texts and corpora]
| |
| *[http://www.collectivelanguage.com/demo.html Collective (Chaotic - Emergent) Language]
| |
| *[ftp://cs.nyu.edu/pub/html/comlex.html/README.html Comlex Syntax (Syntactic Dictionary of English)]
| |
| *[http://www.ai.mit.edu/projects/iiip/doc/cl-http/home-page.html Common Lisp Hypermedia Server]
| |
| *[http://www.cpan.org/ Comprehensive Perl Archive Network]
| |
| *[http://www.cs.brandeis.edu/~llc/cs114/ Computational Linguistics, James Pustejovsky, Brandeis University]
| |
| *[http://mitpress.mit.edu/COLI Computational Linguistics]
| |
| *[http://www.academicpress.com/csl/ Computer Speech and Language]
| |
| *[http://www.cstit.cl.cam.ac.uk/gateway/ Computer Speech, Text and Internet Technology]
| |
| *[https://sourceforge.net/projects/concollate/ Concollate]
| |
| *[http://www.dfki.de/lt/registry/multi/cfs.html Context Feature Structure System (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/sem_and_prag/cfs-x.html Context Feature Structure System (at the DFKI NLP archive)]
| |
| *[http://borel.slu.edu/crubadan/ Corpus building for minority languages]
| |
| *[http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular]
| |
| *[http://www.lllf.uam.es/~fmarcos/informes/corpus/corpulee.html Corpus de referencia de la lengua Espanola contemporanea: corpus oral peninsular]
| |
| *[http://www.corpusdelespanol.org/ Corpus del Espanol]
| |
| *[http://www.corpusdelespanol.org/ Corpus del Espanol]
| |
| *[http://www.athel.com/corpdes.html Corpus of Spoken Professional English]
| |
| *[http://www.hf.uio.no/easteur-orient/bulg/mat/ Corpus of spoken Bulgarian]
| |
| *[http://www.ling.lancs.ac.uk/monkey/ihe/linguistics/contents.htm Course in Corpus Linguistics, Tony McEnery & Andrew Wilson]
| |
| *[ftp://ftp.cs.cornell.edu/pub/smart/cran/ Cranfield collection]
| |
| *[http://ucnk.ff.cuni.cz/english/index.html Czech National Corpus]
| |
|
| |
|
| ==D==
| | [[Category:Resources by language|English]] |
| | |
| *[http://www.dfki.de/lt/registry/nlp_tools/dcgworkbench.html DCG workbench (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/generation/dectalk-x.html DECtalk (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/parsers/disco_chart.html DISCO chart parser (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/nlp_tools/dito.html DITO -- DIagnostic TOol for german syntax (at the DFKI NLP archive)]
| |
| *[http://www.dtreg.com/ DTREG decision tree generator]
| |
| *[http://www.cis.upenn.edu/~dbikel/#stat-parser Dan Bikel's Parser]
| |
| *[http://korpus.dsl.dk/korpus2000/indgang.php Danish news corpus]
| |
| *[http://www.dataharmony.com/ Data Harmony, Document Management Software]
| |
| *[http://www.debian.org/international/ Debian free software community]
| |
| *[http://delphesintl.com/ Delphes Technologies International, natural language processing. ]
| |
| *[http://delphesintl.com/ Delphes Technologies International]
| |
| *[http://www.cs.ualberta.ca/~lindek/demos.htm Demos of dependency database, parser, and other tools]
| |
| *[http://www.cs.ualberta.ca/~lindek/demos.htm Demos, University of Alberta, Canada]
| |
| *[http://www-rcf.usc.edu/~billmann/diversity/DDivers-site.htm Dialogue Diversity Corpus ]
| |
| *[http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html#Spanish-dicts Dictionaries for International Ispell]
| |
| *[http://www.dfki.de/lt/registry/nlp_tools/dimap.html Dictionary Maintenance Programs (at the DFKI NLP archive)]
| |
| *[http://www.dfki.de/lt/registry/nlp_tools/dict21.html Dictionary Maintenance Utilities (at the DFKI NLP archive)]
| |
| *[http://www.bucknell.edu/~rbeard/diction.html Dictionary site, Bucknell University]
| |
| | |
| ==E==
| |
| | |
| ==F==
| |
| | |
| ==G==
| |
| | |
| ==H==
| |
| | |
| ==I==
| |
| | |
| ==J==
| |
| | |
| ==K==
| |
| | |
| ==L==
| |
| | |
| ==M==
| |
| | |
| ==N==
| |
| | |
| ==0==
| |
| | |
| ==P==
| |
| | |
| ==Q==
| |
| | |
| ==R==
| |
| | |
| ==S==
| |
| | |
| ==T==
| |
| | |
| ==U==
| |
| | |
| ==V==
| |
| | |
| ==W==
| |
| | |
| *[http://www.comp.lancs.ac.uk/ucrel/bncfreq/flists.html "Word Frequencies in Written and Spoken English: based on the British National Corpus."]
| |
| | |
| ==X==
| |
| | |
| ==Y==
| |
| | |
| ==Z==
| |