Difference between revisions of "Uncategorized resources"

From ACL Wiki
Jump to navigation Jump to search
m (Change *[http://nltk.sourceforge.net Natural Language Toolkit (NLTK)] to *[http://nltk.net Natural Language Toolkit (NLTK)], move to Multilingual resources, Collaborative projects.)
 
(16 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
<div class="usermessage">
 
<div class="usermessage">
 
* Please ''do not add anything new to this list''; we would like to eventually eliminate this list.
 
* Please ''do not add anything new to this list''; we would like to eventually eliminate this list.
* Please add new items to the [[List of resources by language]] where appropriate.
+
* Please add new items to the [[List of resources by language]].
* Please help us move items below into the [[List of resources by language]] or one of the categories in [[Resources]]; add new categories if needed.
+
* Please help us move items below into the [[List of resources by language]].
 +
* Missing items are denoted with †.
 
</div>
 
</div>
  
*[http://www.ldc.upenn.edu/exploration/survey.html A Survey of Open Language Archives]  
+
*[http://www.siggen.org/resources/ ACL SIGGEN Resources Wiki]
  
*[http://www.siggen.org/resources/ ACL SIGGEN Resources Wiki]  
+
*[http://www.cs.kun.nl/agfl/ AFGL Parser Generator]† 
  
*[http://www.cs.kun.nl/agfl/ AFGL Parser Generator]  
+
*[http://www.a-i.com/ Artificial Intelligence NV (Ai)]
  
*[http://www.let.rug.nl/~vannoord/alp/ Algorithms for Linguistic Processing]
+
*[http://www.eprints.org/ Author/Institution Self-Archiving]
 
 
*[http://www.a-i.com/ Artificial Intelligence NV (Ai)]
 
 
 
*[http://www.eprints.org/ Author/Institution Self-Archiving]  
 
 
 
*[http://www.chinesecomputing.com Chinese Computing]
 
  
 
*[http://www.copernic.com/ COPERNIC 2000]  
 
*[http://www.copernic.com/ COPERNIC 2000]  
  
*[http://www.linguateca.pt/corpografo CorpÛgrafo]
+
*[http://www.cis.upenn.edu/~dbikel/#stat-parser Dan Bikel's Parser]
 
 
*[http://www.cis.upenn.edu/~dbikel/#stat-parser Dan Bikel's Parser]  
 
  
 
*[http://java.sun.com/docs/books/tutorial/i18n/text/boundaryintro.html Detecting Text Boundaries]  
 
*[http://java.sun.com/docs/books/tutorial/i18n/text/boundaryintro.html Detecting Text Boundaries]  
  
*[http://www.catchword.com/era Educational Research Abstracts]  
+
*[http://www.catchword.com/era Educational Research Abstracts]
  
*[http://emotion-research.net/wiki/Databases Emotional Databases]  
+
*[http://emotion-research.net/wiki/Databases Emotional Databases]† 
  
 
*[http://lingo.stanford.edu/erg.html English Resource Grammar]  
 
*[http://lingo.stanford.edu/erg.html English Resource Grammar]  
Line 35: Line 28:
 
*[http://www.cjk.org/cjk/samples/chincomc.htm English-Chinese Chinese-English Dictionary of Computer Terms]  
 
*[http://www.cjk.org/cjk/samples/chincomc.htm English-Chinese Chinese-English Dictionary of Computer Terms]  
  
*[http://www.freelangonline.com/ Freelangonline - many on-line dictionaries + more]  
+
*[http://www.freelangonline.com/ Freelangonline - many on-line dictionaries + more]
  
*[http://dmoz.org/Computers/Software/Information_Retrieval/ Information Retrieval]  
+
*[http://dmoz.org/Computers/Software/Information_Retrieval/ Information Retrieval]† 
  
 
*[http://ir.dcs.gla.ac.uk/resources.html IR resources]  
 
*[http://ir.dcs.gla.ac.uk/resources.html IR resources]  
 
*[http://odin.prohosting.com/hkkim/cgi-bin/kaeps/ Korean Accented English Pronunciation Simulator]
 
  
 
*[http://www.kwicfinder.com/KWiCFinder.html KwicFinder Web Concordancer and Online Research Tool]  
 
*[http://www.kwicfinder.com/KWiCFinder.html KwicFinder Web Concordancer and Online Research Tool]  
  
*[http://www.academiaisla.com/acadi/gen/0_en.html LANGUAGE LINKS]  
+
*[http://www.academiaisla.com/acadi/gen/0_en.html LANGUAGE LINKS]
  
*[http://www.link.cs.cmu.edu/lexfn/ Lexical FreeNet]  
+
*[http://www.link.cs.cmu.edu/lexfn/ Lexical FreeNet]
  
*[http://www-lfg.stanford.edu/lfg/ilfga LFG Database: List of Names]  
+
*[http://www-lfg.stanford.edu/lfg/ilfga LFG Database: List of Names]
  
*[http://lse.umiacs.umd.edu/ Linguist's Search Engine]  
+
*[http://lse.umiacs.umd.edu/ Linguist's Search Engine]
  
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/ Linguistic Interpretation of a German Corpus]  
+
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/ Linguistic Interpretation of a German Corpus]
  
*[http://www.link.cs.cmu.edu/ LINK GRAMMAR PARSER]  
+
*[http://www.link.cs.cmu.edu/ LINK GRAMMAR PARSER]† 
  
*[http://www.eturner.net/linkgrammar-wn/ LinkGrammar-WN project]  
+
*[http://www.eturner.net/linkgrammar-wn/ LinkGrammar-WN project]
  
*[http://www.psy.uwa.edu.au/MRCDataBase/uwa_mrc.htm MRC Psycholinguistic Database]  
+
*[http://www.psy.uwa.edu.au/MRCDataBase/uwa_mrc.htm MRC Psycholinguistic Database]
  
*[http://nl.ijs.si/ME/V3/ Multext East Resources, Version 3]  
+
*[http://nl.ijs.si/ME/V3/ Multext East Resources, Version 3]
  
*[http://multiwordnet.itc.it/english/home.php MultiWordNet]  
+
*[http://multiwordnet.itc.it/english/home.php MultiWordNet]
  
*[http://www.comp.nus.edu.sg/~rpnlpir/ Natural Language Processing / Information Retrieval Software Repository]  
+
*[http://www.comp.nus.edu.sg/~rpnlpir/ Natural Language Processing / Information Retrieval Software Repository]
  
*[http://nlsh.sourceforge.net/ NLSH: Natural Language Shell]  
+
*[http://nlsh.sourceforge.net/ NLSH: Natural Language Shell]
  
 
*[http://www.irisa.fr/Omphalos/ Omphalos Context-Free Language Learning Competition]  
 
*[http://www.irisa.fr/Omphalos/ Omphalos Context-Free Language Learning Competition]  
  
*[http://ysomeya.hp.infoseek.co.jp/ Online Business Letter Corpus KWIC Concordancer]  
+
*[http://ysomeya.hp.infoseek.co.jp/ Online Business Letter Corpus KWIC Concordancer]
  
 
*[http://www.grsampson.net/RLeafAnc.html Parse Evaluation]  
 
*[http://www.grsampson.net/RLeafAnc.html Parse Evaluation]  
  
*[http://pie.usna.edu Phrases in English and the British National Corpus]  
+
*[http://pie.usna.edu Phrases in English and the British National Corpus]
  
*[http://pygoogle.sourceforge.net/ PyGoogle: A Python Interface to the Google API]  
+
*[http://pygoogle.sourceforge.net/ PyGoogle: A Python Interface to the Google API]
  
*[http://www.ai.uga.edu/mc/PythonForNewbieLinguists.html Python Programming Tutorial]  
+
*[http://www.ai.uga.edu/mc/PythonForNewbieLinguists.html Python Programming Tutorial]
  
 
*[http://corpus.leeds.ac.uk/internet.html Query to Internet Corpora]  
 
*[http://corpus.leeds.ac.uk/internet.html Query to Internet Corpora]  
  
*[http://www.lexmasterclass.com/exercises/regex/index.html Regular Expression Exercises]  
+
*[http://www.lexmasterclass.com/exercises/regex/index.html Regular Expression Exercises]
  
*[http://www.sims.berkeley.edu/~hjiang1/eng_chi_resources.html Resources for English-Chinese CLIR]  
+
*[http://www.sims.berkeley.edu/~hjiang1/eng_chi_resources.html Resources for English-Chinese CLIR]
  
 
*[http://www.cs.technion.ac.il/~gabr/resources/resources.html Resources for Text, Speech and Language Processing]  
 
*[http://www.cs.technion.ac.il/~gabr/resources/resources.html Resources for Text, Speech and Language Processing]  
Line 91: Line 82:
 
*[http://www.sfu.ca/rst/ Rhetorical Structure Theory (RST)]  
 
*[http://www.sfu.ca/rst/ Rhetorical Structure Theory (RST)]  
  
*[http://www.philol.msu.ru/rus/galya-1 Russian Phonetics on the Web]  
+
*[http://www.clres.com/SensSemRoles.html Senseval-3 Task: Automatic Labeling of Semantic Roles]
  
*[http://www.clres.com/SensSemRoles.html Senseval-3 Task: Automatic Labeling of Semantic Roles]  
+
*[http://www.clres.com/SensWNDisamb.html Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses]
  
*[http://www.clres.com/SensWNDisamb.html Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses]
+
*[http://www.cs.unt.edu/~rada/wa/ Sentence Alignment and Word Alignment: Projects, Papers, Evaluation, Etc.]
 
 
*[http://www.cs.unt.edu/~rada/wa/ Sentence Alignment and Word Alignment: Projects, Papers, Evaluation, Etc.]  
 
  
 
*[http://ontoweb-lt.dfki.de/knowledge_index.htm SIG5 OntoWeb]  
 
*[http://ontoweb-lt.dfki.de/knowledge_index.htm SIG5 OntoWeb]  
  
*[http://sara.natcorp.ox.ac.uk/lookup.html Simple Search of BNC-World]  
+
*[http://sara.natcorp.ox.ac.uk/lookup.html Simple Search of BNC-World]
  
 
*[http://www.sigsem.org/ Special Interest Group on Computational Semantics]  
 
*[http://www.sigsem.org/ Special Interest Group on Computational Semantics]  
  
*[http://www.grsampson.net/RSue.html SUSANNE Analytic Scheme]
+
*[http://www.telemakus.net/ Telemakus: Mining and Mapping Research Findings to Promote Knowledge Discovery]
 
 
*[http://www.telemakus.net/ Telemakus: Mining and Mapping Research Findings to Promote Knowledge Discovery]  
 
  
 
*[http://www.clef-campaign.org/ The Cross-Language Evaluation Forum]  
 
*[http://www.clef-campaign.org/ The Cross-Language Evaluation Forum]  
Line 149: Line 136:
 
*[http://www.gmi.org/wlms/ World Language Mapping System]
 
*[http://www.gmi.org/wlms/ World Language Mapping System]
  
*[http://www.comp.leeds.ac.uk/eric/iwcs.ps A Domain-Independent Semantic Tagger for the Study of Meaning Associations in English Text]
+
*[http://www.comp.leeds.ac.uk/eric/iwcs.ps A Domain-Independent Semantic Tagger for the Study of Meaning Associations in English Text]
  
*[http://www3.interscience.wiley.com/cgi-bin/abstract/104525215/ABSTRACT Automatic Construction of English/Chinese Parallel Corpora]
+
*[http://www3.interscience.wiley.com/cgi-bin/abstract/104525215/ABSTRACT Automatic Construction of English/Chinese Parallel Corpora]
  
 
*[ftp://ftp.icsi.berkeley.edu/pub/techreports/ Berkeley - Technical Reports]
 
*[ftp://ftp.icsi.berkeley.edu/pub/techreports/ Berkeley - Technical Reports]
Line 190: Line 177:
  
 
*[http://xlex.uni-muenster.de/ M&uuml;nster Tagging Project]
 
*[http://xlex.uni-muenster.de/ M&uuml;nster Tagging Project]
 
*[http://nltk.sourceforge.net Natural Language Toolkit (NLTK)]
 
  
 
*[http://perso.wanadoo.fr/rosavram/ NooJ]
 
*[http://perso.wanadoo.fr/rosavram/ NooJ]
Line 205: Line 190:
 
*[http://www40.brinkster.com/dictionarium/index.html dictionarium]
 
*[http://www40.brinkster.com/dictionarium/index.html dictionarium]
  
*[http://www.mat.upm.es/~aries ARIES Natural Language Tools]
+
*[http://www.mat.upm.es/~aries ARIES Natural Language Tools]
  
 
*[http://www.york.ac.uk/services/library/subjects/langint.htm Language and Linguistic Science information sources]  
 
*[http://www.york.ac.uk/services/library/subjects/langint.htm Language and Linguistic Science information sources]  
Line 212: Line 197:
  
 
*[ftp://parcftp.xerox.com/pub/ Xerox PARC FTP site.]
 
*[ftp://parcftp.xerox.com/pub/ Xerox PARC FTP site.]
 +
 +
*[http://actarus.atilf.fr/morphalou/ Lexique Morphalou]
 +
 +
*[http://orleans.lti.cs.cmu.edu/Reap/ REAP Project: Reader-Specific Lexical Practice for Improved Reading Comprehension]
 +
 +
*[http://people.cs.uchicago.edu/~dinoj/icsi97syl.disc.gz Syllable-Level Conversational English Transcriptions]
 +
 +
*[http://www.sil.org/mexico/pub/vimsa.htm The Mariano Silva y Aceves Series]
 +
 +
*[http://www.unine.ch/info/clef/ UniNE stopword list for Portuguese]
 +
 +
*[http://www.valencianlanguage.com/ Valencianlanguage.com]
 +
 +
* [http://www.answerbus.com AnswerBus]
 +
 +
* [http://www.axontologic.com Axontologic]†
 +
 +
* [http://www.bionlp.org Natural language processing of biology text]
 +
 +
* [http://textalyser.net Text analysis]
 +
 +
* [http://www.20000-names.com/ 2000 names from around the world]
 +
 +
* [http://www.facstaff.bucknell.edu/rbeard/diction.html A web of online dictionaries]
 +
 +
* [http://abu.cnam.fr/ ABU : la Biblioth\`eque Universelle]
 +
 +
* [http://www.comp.leeds.ac.uk/amalgam/tagsets/tagmenu.html AMALGAM English PoS tagsets]†
 +
 +
* [http://www.comp.leeds.ac.uk/amalgam/amalgam/multi-tagged.html AMALGAM Multi-Tagged Corpus]†
 +
 +
* [http://www.comp.leeds.ac.uk/amalgam/amalgam/multi-parsed.html AMALGAM Multi-Treebank]†
 +
 +
* [http://www.comp.leeds.ac.uk/amalgam/amalgam/amalgtag3.html AMALGAM email Part-of-Speech tagger]†
 +
 +
* [http://www.lpl.univ-aix.fr/projects/arcade/ ARCADE: Evaluation of text alignment tools]†
 +
 +
* [http://www.aspseek.org/ ASPSEEK search engine]†
 +
 +
* [http://www.ucc.ie/info/net/acronyms/acro.html Acronym Server]†
 +
 +
* [http://tcc.itc.it/people/lavelli/ Alberto Lavelli, ITC]†
 +
 +
* [http://www.wtsn.binghamton.edu/ANS/ American Name Society]†
 +
 +
* [http://www.mat.upm.es/~aries/description.html Aries natural language tools]†
 +
 +
* [http://linguistlist.org/~ask-ling Ask a Linguist]†
 +
 +
* [http://www.webspawner.com/users/samplex/ Build a program immediately from data processing examples: NLP]
 +
 +
* [http://www.link.cs.cmu.edu/link/submit-sentence-4.html CMU Sentence Parser]
 +
 +
* [http://corpora.ids-mannheim.de/~cosmas COSMAS - Corpus Storage, Maintenance and Access System]
 +
 +
* [http://www.uni-paderborn.de/fachbereich/AG/agmadh/WWW/english/scripts.html Collection of Lecture Notes, Surveys, and Papers]
 +
 +
* [http://searchenginewatch.internet.com/facts/searches.html Corpora of user queries]
 +
 +
* [http://www.cs.vassar.edu/CES/ Corpus Enconding Standard]
 +
 +
* [http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/ Corpus Workbench]
 +
 +
* [http://www.robotwisdom.com/ai/cycresources.html Cyc resources]
 +
 +
* [http://www.ict.usc.edu/~traum David Traum, USC]
 +
 +
* [http://www.elcom.de/ ELCOM - Tuersprechanlagen, Produzent fuer Tuersprechanlagen... Hauskommunikation fuer Ihre Sicherheit.]
 +
 +
* [http://www.comp.leeds.ac.uk/eric/ Eric Atwell, University of Leeds]
 +
 +
* [http://www.pg.gda.pl/~jandac/fsa.html Finite-State Utilities for morphological analysis (known and unknown words), spelling correction, perfect hashing, diacritics restoration]
 +
 +
* [http://www.xrce.xerox.com/cgi-bin/mltt/nodict-fsc-teach-int French conjugation test]
 +
 +
* [http://www.cs.utk.edu/~cs460.is&r/glossary.html Glossary for the Book Information Storage and Retrieval by R. Korfhage]
 +
 +
* [http://www.kinet-tv.ne.jp/~sat/data/publications/2001/c36.html Hatayama parser]
 +
 +
* [http://www.cs.ust.hk Human Language Technology Center, UST, Hong Kong]
 +
 +
* [http://www.ihear.com/ IHear]
 +
 +
* [http://www.ribbitsoft.com/ist/ist.html Information Search and Transformation Technologies]
 +
 +
* [http://www.illc.uva.nl/~christof Institute for Logic, Language and Computation (ILLC) - University of Amsterdam]
 +
 +
* [http://www.germanistik.fu-berlin.de/il/ Integrational Linguistics]
 +
 +
* [http://www.xs4all.nl/~bsarempt/linguistics/index.html Kura]
 +
 +
* [http://www.lrec-conf.org/lrec2004/doc/ws/CPSLC.pdf LREC 2004 Workshop: Compiling and processing spoken language corpora]
 +
 +
* [http://www.comp.lancs.ac.uk/computing/research/ucrel/claws/trial.html Lancaster CLAWS PoS tagger]
 +
 +
* [http://www.asel.udel.edu/natlang/nlp/lrd.html Language Representation Database]
 +
 +
* [http://lsa.colorado.edu/content.html Latent Semantic Analysis]
 +
 +
* [http://www.blackwellpublishers.co.uk/labs Linguistics Abstracts]
 +
 +
* [http://www.phil.uni-passau.de/linguistik/linguistik_urls/ Linguistics Links Database, University of Passau]
 +
 +
* [http://www.surrey.ac.uk/ELI/thesis.html MA dissertations at Univ. of Surrey, English Language Institute]
 +
 +
* [http://www.ai.mit.edu/projects/infolab MIT AI Lab NLP Demos]
 +
 +
* [http://www.cs.cmu.edu/~aberger/maxent.html MaxEnt and Exponential Models, Tutorials by Adam Berger, CMU]
 +
 +
* [http://www.scn.org/~mentifex/ Mentifex AI]
 +
 +
* [http://meilleursprenoms.com/ Most frequent French names]
 +
 +
* [http://www.cee.hw.ac.uk/~alison/nl/details.html Natural Language Processing Course]
 +
 +
* [http://nlpers.blogspot.com/ Natural Language Processing Research Blog]
 +
 +
* [http://www.elsnet.org/publications/survey/ Natural Language Processing in Eastern Europe]
 +
 +
* [http://www.cs.dal.ca/~vlado/csci6509/ Natural Language Processing, CSCI6509 (Dalhousie University)]
 +
 +
* [http://www.canoo.net/ Ontology-Based Modelling in the Humanities]
 +
 +
* [http://sunsite.unc.edu/patents/intropat.html Patent Information (includes patents in IR, Acoustics...)]
 +
 +
* [http://www.cis.upenn.edu/~treebank/home.html Penn Treebank Corpus]
 +
 +
* [http://pie.usna.edu/ Phrases in English]
 +
 +
* [http://www.cs.monash.edu.au/cgi-bin/postmodern Postmodernist Journal Paper Generator]
 +
 +
* [http://acl.ldc.upenn.edu/W/W05/ Proceedings of ENLG-05]
 +
 +
* [http://www.aus.xanadu.com/ Project Xanadu]
 +
 +
* [http://www.riexinger.com/ RIEXINGER Tuerenwerke]
 +
 +
* [http://www.ualberta.ca/~dmiall/reading/index.htm Reader Response: Empirical Studies of Literary Reading]
 +
 +
* [http://www.iol.ie/~mazzoldi/lang/index.html Resources for translators]
 +
 +
* [http://www.lpl.univ-aix.fr/projects/romanseval/ Romanseval : Word sense disambiguation for Romance languages]
 +
 +
* [http://www.dfki.de/lt/diss/ Saarbruecken Dissertations in Computational Linguistics and Language Technology]
 +
 +
* [http://www.ccs.neu.edu/home/futrelle/bionlp/search.html Search Bionlp.org]
 +
 +
* [http://www.itri.brighton.ac.uk/events/senseval/ Senseval 2 : Evaluating Word Sense Disambiguation Systems]
 +
 +
* [http://www.startext.co.uk/ Startext: small, private language & document engineering practice]
 +
 +
* [http://www.cs.cornell.edu/courses/cs775/2001sp/default.html Statistical Natural Language Processing: Models and Methods (Cornell U. COMS 775)]
 +
 +
* [http://www.w3.org/TR/2001/WD-ngram-spec-20010103/ Stochastic Language Models (N-Gram) Specification W3C Working Draft (January 3, 2001)]
 +
 +
* [http://crl.nmsu.edu/summerschool Summer School at NMSU]
 +
 +
* [http://www.let.rug.nl/~vannoord/TextCat/Demo/ TextCat Language Guesser]
 +
 +
* [http://www.mitre.org/technology/alembic-workbench/ The Alembic Workbench]
 +
 +
* [http://egnatia.ee.auth.gr/~yhat/yiannis The LexiGraf web page]
 +
 +
* [http://linguistlist.org/ The Linguist List]
 +
 +
* [http://www.sil.org/computing/shoebox/ The Linguist's Shoebox]
 +
 +
* [http://www.logos.it/magazine/ The Magazine for the Language Industries]
 +
 +
* [http://www.coli.uni-sb.de/sfb378/negra-corpus/negra-corpus.html The Negra Corpus - German Syntax annotated]
 +
 +
* [http://www.csun.edu/~vcoao0el/de361/de361s53_folder/english.html The Trouble with English Spelling]
 +
 +
* [http://www.webcorp.org.uk/ The WebCorp project]
 +
 +
* [http://timexportal.info TimexPortal - Temporal Expression Recognition and Normalisation]
 +
 +
* [http://www.comp.lancs.ac.uk/ucrel/tech_papers.html UCREL Technical Papers]
 +
 +
* [http://www.czyborra.com/unifont/ Unifont, the Unicode font]
 +
 +
* [http://rreck.sealsoft.com/ Web based language analysis, keyword searching of project Gutenburg texts]
 +
 +
* [http://rreck.sealsoft.com/cgi-bin/apps/wordfind/wordfind.pl Word Finder - uses Project Gutenberg corpus]
 +
 +
* [http://rreck.sealsoft.com/cgi-bin/apps/wordfind/freqgen.pl Word Frequency Analyzer - uses Project Gutenberg corpus]
 +
 +
* [http://wndomains.itc.it/ Word Sense Disambiguation and Categorization at IRST]
 +
 +
* [http://www.liv.ac.uk/~ms2928/wordsmith/index.htm WordSmith Tools]
 +
 +
* [http://linkage.rockefeller.edu/wli/zipf/ Zipf's Law]
 +
 +
* [http://www.entmp.org/linguistics/phonGIF/ phonGIF - bitpapped IPA characters (based on the wsuipa METAFONT)]
 +
 +
*[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html 2000 NIST Speaker Recognition Evaluation Corpus]
 +
 +
*[http://ixa.si.ehu.es/Ixa/resources/sensecorpus A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)]
 +
 +
*[http://odur.let.rug.nl/~vannoord/trees/ Alpino Treebank]†
 +
 +
*[http://www.aot.ru/search1.html AOT]†
 +
 +
*[http://pioneer.chula.ac.th/~awirote/ling/corpuslst.htm Corpus Resources (Chulalongkorn University, Thailand)]
 +
 +
*[ftp://ftp.cs.cornell.edu/pub/smart/cran/ Cranfield collection]
 +
 +
*[http://corpus.rae.es/creanet.html CREA]
 +
 +
*[http://www.eat.rl.ac.uk/ Edinburgh Associative Thesaurus (EAT)]
 +
 +
*[http://www.hum.uva.nl/~ewn EuroWordNet]
 +
 +
*[http://rali.iro.umontreal.ca/ Hansards Corpus - Searchable]
 +
 +
*[http://www.hcrc.ed.ac.uk/maptask/ HCRC Map Task Corpus XML annotations]
 +
 +
*[http://nats-www.informatik.uni-hamburg.de/~ingo/icopost/ ICOPOST]
 +
 +
*[http://www.ims.uni-stuttgart.de/projekte/TC.html IMS Corpus Toolbox, Univ. of Stuttgart]
 +
 +
*[http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/ IMS Corpus Workbench (CWB)]
 +
 +
*[http://cecl.fltr.ucl.ac.be/Cecl-Projects/Icle/icle.htm International Corpus of Learner English]
 +
 +
*[http://www.ipds.uni-kiel.de/links/datenmaterial.en.html Kiel University's Institute on Phonetics and Speech Procesing]
 +
 +
*[http://www.nilc.icmc.usp.br/lacioweb Lacio Web Corpora]
 +
 +
*[http://www.vuw.ac.nz/llc/ LANGUAGE LEARNING CENTER - ACADEMIC CORPUS]
 +
 +
*[http://www.bmanuel.org/clr2_mp.html Manuel Barbera: General Corpora and Corpus Linguistics Resources]
 +
 +
*[ftp://ftp.cs.cornell.edu/pub/smart/med/ Medlars collection]
 +
 +
*[ftp://ftp.ox.ac.uk/pub/wordlists/ Miscellaneous Word Lists from Oxford University]
 +
 +
*[http://www.lpl.univ-aix.fr/projects/multext/ Multilingual Text Tools and Corpora]
 +
 +
*[http://www.census.gov/genealogy/names Name lists from US census]
 +
 +
*[http://www.di.fc.ul.pt/~ahb/nexing.htm Nexing Corpus]
 +
 +
*[http://www.cs.cmu.edu/web/books.html On-line books at CMU]
 +
 +
*[http://logos.uio.no/opus/ OPUS -- An Open Source Parallel Corpus]
 +
 +
*[http://elex.amu.edu.pl/~przemka/PICLE_search.php Polish subcorpus of the International Corpus of Learner English]
 +
 +
*[http://www.cirp.es/WXN/wxn/frames/proxectos.html Ramon Piero Center for Research]
 +
 +
*[http://about.reuters.com/researchandstandards/corpus/ Reuters Corpus]
 +
 +
*[http://www.ldc.upenn.edu/Catalog/LDC2001S97.html Speech in Noisy Environments 1 (SPINE1 CODED) Coded Audio]
 +
 +
*[http://www.ldc.upenn.edu/Catalog/LDC2001S99.html Speech in Noisy Environments 2 (SPINE2 CODED) Coded Audio]
 +
 +
*[http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/doc/notes/corpora.txt Survey of Electronic Corpora (by Jane A.
 +
Edwards, file at CMU)]
 +
 +
*[http://www.ucl.ac.uk/english-usage/ Survey of English Usage, University College, London]
 +
 +
*[http://www.icsi.berkeley.edu/real/stp/index.html Switchboard Transcription Project]
 +
 +
*[http://www.tractor.de/ TELRI Research Archive of Computational Tools and Resources]
 +
 +
*[http://nora.hd.uib.no/index-e.html The CORPORA DataCenter (Norway)]
 +
 +
*[ftp://ftp.dcs.shef.ac.uk/share/ilash/Moby/ The Moby Corpus]
 +
 +
*[http://www.hf.uio.no/tekstlab/prosjekter/SOFIE.htm The Sofie Treebank - A Parallel Treebank of North European Languages]

Latest revision as of 03:00, 12 September 2019

Edwards, file at CMU)]