Uncategorized resources: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
Sean Bethard (talk | contribs) m Change *[http://nltk.sourceforge.net Natural Language Toolkit (NLTK)] to *[http://nltk.net Natural Language Toolkit (NLTK)], move to Multilingual resources, Collaborative projects. |
||
| (9 intermediate revisions by the same user not shown) | |||
| Line 3: | Line 3: | ||
* Please add new items to the [[List of resources by language]]. | * Please add new items to the [[List of resources by language]]. | ||
* Please help us move items below into the [[List of resources by language]]. | * Please help us move items below into the [[List of resources by language]]. | ||
* Missing items are denoted with †. | |||
</div> | </div> | ||
*[http://www. | *[http://www.siggen.org/resources/ ACL SIGGEN Resources Wiki]† | ||
*[http://www. | *[http://www.cs.kun.nl/agfl/ AFGL Parser Generator]† | ||
*[http://www. | *[http://www.a-i.com/ Artificial Intelligence NV (Ai)]† | ||
*[http://www.eprints.org/ Author/Institution Self-Archiving]† | |||
*[http://www.eprints.org/ Author/Institution Self-Archiving] | |||
*[http://www.copernic.com/ COPERNIC 2000] | *[http://www.copernic.com/ COPERNIC 2000] | ||
*[http://www.cis.upenn.edu/~dbikel/#stat-parser Dan Bikel's Parser] | *[http://www.cis.upenn.edu/~dbikel/#stat-parser Dan Bikel's Parser]† | ||
*[http://java.sun.com/docs/books/tutorial/i18n/text/boundaryintro.html Detecting Text Boundaries] | *[http://java.sun.com/docs/books/tutorial/i18n/text/boundaryintro.html Detecting Text Boundaries] | ||
*[http://www.catchword.com/era Educational Research Abstracts] | *[http://www.catchword.com/era Educational Research Abstracts]† | ||
*[http://emotion-research.net/wiki/Databases Emotional Databases] | *[http://emotion-research.net/wiki/Databases Emotional Databases]† | ||
*[http://lingo.stanford.edu/erg.html English Resource Grammar] | *[http://lingo.stanford.edu/erg.html English Resource Grammar] | ||
| Line 31: | Line 28: | ||
*[http://www.cjk.org/cjk/samples/chincomc.htm English-Chinese Chinese-English Dictionary of Computer Terms] | *[http://www.cjk.org/cjk/samples/chincomc.htm English-Chinese Chinese-English Dictionary of Computer Terms] | ||
*[http://www.freelangonline.com/ Freelangonline - many on-line dictionaries + more] | *[http://www.freelangonline.com/ Freelangonline - many on-line dictionaries + more]† | ||
*[http://dmoz.org/Computers/Software/Information_Retrieval/ Information Retrieval] | *[http://dmoz.org/Computers/Software/Information_Retrieval/ Information Retrieval]† | ||
*[http://ir.dcs.gla.ac.uk/resources.html IR resources] | *[http://ir.dcs.gla.ac.uk/resources.html IR resources] | ||
*[http://www.kwicfinder.com/KWiCFinder.html KwicFinder Web Concordancer and Online Research Tool] | *[http://www.kwicfinder.com/KWiCFinder.html KwicFinder Web Concordancer and Online Research Tool] | ||
*[http://www.academiaisla.com/acadi/gen/0_en.html LANGUAGE LINKS] | *[http://www.academiaisla.com/acadi/gen/0_en.html LANGUAGE LINKS]† | ||
*[http://www.link.cs.cmu.edu/lexfn/ Lexical FreeNet] | *[http://www.link.cs.cmu.edu/lexfn/ Lexical FreeNet]† | ||
*[http://www-lfg.stanford.edu/lfg/ilfga LFG Database: List of Names] | *[http://www-lfg.stanford.edu/lfg/ilfga LFG Database: List of Names]† | ||
*[http://lse.umiacs.umd.edu/ Linguist's Search Engine] | *[http://lse.umiacs.umd.edu/ Linguist's Search Engine]† | ||
*[http://www.ims.uni-stuttgart.de/projekte/TIGER/ Linguistic Interpretation of a German Corpus] | *[http://www.ims.uni-stuttgart.de/projekte/TIGER/ Linguistic Interpretation of a German Corpus]† | ||
*[http://www.link.cs.cmu.edu/ LINK GRAMMAR PARSER] | *[http://www.link.cs.cmu.edu/ LINK GRAMMAR PARSER]† | ||
*[http://www.eturner.net/linkgrammar-wn/ LinkGrammar-WN project] | *[http://www.eturner.net/linkgrammar-wn/ LinkGrammar-WN project]† | ||
*[http://www.psy.uwa.edu.au/MRCDataBase/uwa_mrc.htm MRC Psycholinguistic Database] | *[http://www.psy.uwa.edu.au/MRCDataBase/uwa_mrc.htm MRC Psycholinguistic Database]† | ||
*[http://nl.ijs.si/ME/V3/ Multext East Resources, Version 3] | *[http://nl.ijs.si/ME/V3/ Multext East Resources, Version 3] | ||
*[http://multiwordnet.itc.it/english/home.php MultiWordNet] | *[http://multiwordnet.itc.it/english/home.php MultiWordNet]† | ||
*[http://www.comp.nus.edu.sg/~rpnlpir/ Natural Language Processing / Information Retrieval Software Repository] | *[http://www.comp.nus.edu.sg/~rpnlpir/ Natural Language Processing / Information Retrieval Software Repository]† | ||
*[http://nlsh.sourceforge.net/ NLSH: Natural Language Shell] | *[http://nlsh.sourceforge.net/ NLSH: Natural Language Shell]† | ||
*[http://www.irisa.fr/Omphalos/ Omphalos Context-Free Language Learning Competition] | *[http://www.irisa.fr/Omphalos/ Omphalos Context-Free Language Learning Competition] | ||
*[http://ysomeya.hp.infoseek.co.jp/ Online Business Letter Corpus KWIC Concordancer] | *[http://ysomeya.hp.infoseek.co.jp/ Online Business Letter Corpus KWIC Concordancer]† | ||
*[http://www.grsampson.net/RLeafAnc.html Parse Evaluation] | *[http://www.grsampson.net/RLeafAnc.html Parse Evaluation] | ||
*[http://pie.usna.edu Phrases in English and the British National Corpus] | *[http://pie.usna.edu Phrases in English and the British National Corpus]† | ||
*[http://pygoogle.sourceforge.net/ PyGoogle: A Python Interface to the Google API] | *[http://pygoogle.sourceforge.net/ PyGoogle: A Python Interface to the Google API] | ||
*[http://www.ai.uga.edu/mc/PythonForNewbieLinguists.html Python Programming Tutorial] | *[http://www.ai.uga.edu/mc/PythonForNewbieLinguists.html Python Programming Tutorial]† | ||
*[http://corpus.leeds.ac.uk/internet.html Query to Internet Corpora] | *[http://corpus.leeds.ac.uk/internet.html Query to Internet Corpora] | ||
*[http://www.lexmasterclass.com/exercises/regex/index.html Regular Expression Exercises] | *[http://www.lexmasterclass.com/exercises/regex/index.html Regular Expression Exercises]† | ||
*[http://www.sims.berkeley.edu/~hjiang1/eng_chi_resources.html Resources for English-Chinese CLIR] | *[http://www.sims.berkeley.edu/~hjiang1/eng_chi_resources.html Resources for English-Chinese CLIR]† | ||
*[http://www.cs.technion.ac.il/~gabr/resources/resources.html Resources for Text, Speech and Language Processing] | *[http://www.cs.technion.ac.il/~gabr/resources/resources.html Resources for Text, Speech and Language Processing] | ||
| Line 87: | Line 82: | ||
*[http://www.sfu.ca/rst/ Rhetorical Structure Theory (RST)] | *[http://www.sfu.ca/rst/ Rhetorical Structure Theory (RST)] | ||
*[http://www.clres.com/SensSemRoles.html Senseval-3 Task: Automatic Labeling of Semantic Roles] | *[http://www.clres.com/SensSemRoles.html Senseval-3 Task: Automatic Labeling of Semantic Roles]† | ||
*[http://www.clres.com/SensWNDisamb.html Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses] | *[http://www.clres.com/SensWNDisamb.html Senseval-3 Task: Word-Sense Disambiguation of WordNet Glosses]† | ||
*[http://www.cs.unt.edu/~rada/wa/ Sentence Alignment and Word Alignment: Projects, Papers, Evaluation, Etc.] | *[http://www.cs.unt.edu/~rada/wa/ Sentence Alignment and Word Alignment: Projects, Papers, Evaluation, Etc.]† | ||
*[http://ontoweb-lt.dfki.de/knowledge_index.htm SIG5 OntoWeb] | *[http://ontoweb-lt.dfki.de/knowledge_index.htm SIG5 OntoWeb] | ||
*[http://sara.natcorp.ox.ac.uk/lookup.html Simple Search of BNC-World] | *[http://sara.natcorp.ox.ac.uk/lookup.html Simple Search of BNC-World]† | ||
*[http://www.sigsem.org/ Special Interest Group on Computational Semantics] | *[http://www.sigsem.org/ Special Interest Group on Computational Semantics] | ||
*[http://www.telemakus.net/ Telemakus: Mining and Mapping Research Findings to Promote Knowledge Discovery]† | |||
*[http://www.telemakus.net/ Telemakus: Mining and Mapping Research Findings to Promote Knowledge Discovery] | |||
*[http://www.clef-campaign.org/ The Cross-Language Evaluation Forum] | *[http://www.clef-campaign.org/ The Cross-Language Evaluation Forum] | ||
| Line 143: | Line 136: | ||
*[http://www.gmi.org/wlms/ World Language Mapping System] | *[http://www.gmi.org/wlms/ World Language Mapping System] | ||
*[http://www.comp.leeds.ac.uk/eric/iwcs.ps A Domain-Independent Semantic Tagger for the Study of Meaning Associations in English Text] | *[http://www.comp.leeds.ac.uk/eric/iwcs.ps A Domain-Independent Semantic Tagger for the Study of Meaning Associations in English Text]† | ||
*[http://www3.interscience.wiley.com/cgi-bin/abstract/104525215/ABSTRACT Automatic Construction of English/Chinese Parallel Corpora] | *[http://www3.interscience.wiley.com/cgi-bin/abstract/104525215/ABSTRACT Automatic Construction of English/Chinese Parallel Corpora]† | ||
*[ftp://ftp.icsi.berkeley.edu/pub/techreports/ Berkeley - Technical Reports] | *[ftp://ftp.icsi.berkeley.edu/pub/techreports/ Berkeley - Technical Reports] | ||
| Line 184: | Line 177: | ||
*[http://xlex.uni-muenster.de/ Münster Tagging Project] | *[http://xlex.uni-muenster.de/ Münster Tagging Project] | ||
*[http://perso.wanadoo.fr/rosavram/ NooJ] | *[http://perso.wanadoo.fr/rosavram/ NooJ] | ||
| Line 199: | Line 190: | ||
*[http://www40.brinkster.com/dictionarium/index.html dictionarium] | *[http://www40.brinkster.com/dictionarium/index.html dictionarium] | ||
*[http://www.mat.upm.es/~aries ARIES Natural Language Tools] | *[http://www.mat.upm.es/~aries ARIES Natural Language Tools]† | ||
*[http://www.york.ac.uk/services/library/subjects/langint.htm Language and Linguistic Science information sources] | *[http://www.york.ac.uk/services/library/subjects/langint.htm Language and Linguistic Science information sources] | ||
| Line 221: | Line 212: | ||
* [http://www.answerbus.com AnswerBus] | * [http://www.answerbus.com AnswerBus] | ||
* [http://www.axontologic.com Axontologic] | * [http://www.axontologic.com Axontologic]† | ||
* [http://www.bionlp.org Natural language processing of biology text] | * [http://www.bionlp.org Natural language processing of biology text] | ||
| Line 233: | Line 224: | ||
* [http://abu.cnam.fr/ ABU : la Biblioth\`eque Universelle] | * [http://abu.cnam.fr/ ABU : la Biblioth\`eque Universelle] | ||
* [http://www.comp.leeds.ac.uk/amalgam/tagsets/tagmenu.html AMALGAM English PoS tagsets] | * [http://www.comp.leeds.ac.uk/amalgam/tagsets/tagmenu.html AMALGAM English PoS tagsets]† | ||
* [http://www.comp.leeds.ac.uk/amalgam/amalgam/multi-tagged.html AMALGAM Multi-Tagged Corpus] | * [http://www.comp.leeds.ac.uk/amalgam/amalgam/multi-tagged.html AMALGAM Multi-Tagged Corpus]† | ||
* [http://www.comp.leeds.ac.uk/amalgam/amalgam/multi-parsed.html AMALGAM Multi-Treebank] | * [http://www.comp.leeds.ac.uk/amalgam/amalgam/multi-parsed.html AMALGAM Multi-Treebank]† | ||
* [http://www.comp.leeds.ac.uk/amalgam/amalgam/amalgtag3.html AMALGAM email Part-of-Speech tagger] | * [http://www.comp.leeds.ac.uk/amalgam/amalgam/amalgtag3.html AMALGAM email Part-of-Speech tagger]† | ||
* [http://www.lpl.univ-aix.fr/projects/arcade/ ARCADE: Evaluation of text alignment tools] | * [http://www.lpl.univ-aix.fr/projects/arcade/ ARCADE: Evaluation of text alignment tools]† | ||
* [http://www.aspseek.org/ ASPSEEK search engine] | * [http://www.aspseek.org/ ASPSEEK search engine]† | ||
* [http://www.ucc.ie/info/net/acronyms/acro.html Acronym Server] | * [http://www.ucc.ie/info/net/acronyms/acro.html Acronym Server]† | ||
* [http://tcc.itc.it/people/lavelli/ Alberto Lavelli, ITC] | * [http://tcc.itc.it/people/lavelli/ Alberto Lavelli, ITC]† | ||
* [http://www.wtsn.binghamton.edu/ANS/ American Name Society] | * [http://www.wtsn.binghamton.edu/ANS/ American Name Society]† | ||
* [http://www.mat.upm.es/~aries/description.html Aries natural language tools] | * [http://www.mat.upm.es/~aries/description.html Aries natural language tools]† | ||
* [http://linguistlist.org/~ask-ling Ask a Linguist] | * [http://linguistlist.org/~ask-ling Ask a Linguist]† | ||
* [http://www.webspawner.com/users/samplex/ Build a program immediately from data processing examples: NLP] | * [http://www.webspawner.com/users/samplex/ Build a program immediately from data processing examples: NLP] | ||
| Line 260: | Line 251: | ||
* [http://corpora.ids-mannheim.de/~cosmas COSMAS - Corpus Storage, Maintenance and Access System] | * [http://corpora.ids-mannheim.de/~cosmas COSMAS - Corpus Storage, Maintenance and Access System] | ||
* [http://www.uni-paderborn.de/fachbereich/AG/agmadh/WWW/english/scripts.html Collection of Lecture Notes, Surveys, and Papers] | * [http://www.uni-paderborn.de/fachbereich/AG/agmadh/WWW/english/scripts.html Collection of Lecture Notes, Surveys, and Papers] | ||
| Line 330: | Line 319: | ||
* [http://www.canoo.net/ Ontology-Based Modelling in the Humanities] | * [http://www.canoo.net/ Ontology-Based Modelling in the Humanities] | ||
* [http://sunsite.unc.edu/patents/intropat.html Patent Information (includes patents in IR, Acoustics...)] | * [http://sunsite.unc.edu/patents/intropat.html Patent Information (includes patents in IR, Acoustics...)] | ||
| Line 409: | Line 396: | ||
*[http://ixa.si.ehu.es/Ixa/resources/sensecorpus A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)] | *[http://ixa.si.ehu.es/Ixa/resources/sensecorpus A Web Corpus and Topic Signatures for All WordNet 1.6 Nominal Senses (v 1.0)] | ||
*[http://odur.let.rug.nl/~vannoord/trees/ Alpino Treebank] | *[http://odur.let.rug.nl/~vannoord/trees/ Alpino Treebank]† | ||
*[http://www.aot.ru/search1.html AOT] | *[http://www.aot.ru/search1.html AOT]† | ||
*[http://pioneer.chula.ac.th/~awirote/ling/corpuslst.htm Corpus Resources (Chulalongkorn University, Thailand)] | *[http://pioneer.chula.ac.th/~awirote/ling/corpuslst.htm Corpus Resources (Chulalongkorn University, Thailand)] | ||
| Line 475: | Line 462: | ||
*[http://www.tractor.de/ TELRI Research Archive of Computational Tools and Resources] | *[http://www.tractor.de/ TELRI Research Archive of Computational Tools and Resources] | ||
*[http://nora.hd.uib.no/index-e.html The CORPORA DataCenter (Norway)] | *[http://nora.hd.uib.no/index-e.html The CORPORA DataCenter (Norway)] | ||
Latest revision as of 09:00, 12 September 2019
- Proceedings of the Corpus Linguistics 2005 Workshop on Using Corpora for Natural Language Generation
- AOT†
- [http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/doc/notes/corpora.txt Survey of Electronic Corpora (by Jane A.
Edwards, file at CMU)]