Difference between revisions of "Resources for Coptic"

From ACL Wiki
Jump to navigation Jump to search
(Added Coptic resources page)
 
(More Coptic resources)
Line 4: Line 4:
  
 
==Tools==
 
==Tools==
* An online NLP pipeline interface and API are available at http://corpling.uis.georgetown.edu/coptic-nlp
+
* An online NLP pipeline interface and API are available at http://corpling.uis.georgetown.edu/coptic-nlp . Also includes a dependency parser.
* Individual tools:
+
* Individual command line tools used in the pipeline:
** [https://github.com/CopticScriptorium/tokenizers Tokenizer] - for UTF-8 plain text or XML input
+
** [https://github.com/CopticScriptorium/tokenizers Tokenizer] - for UTF-8 plain text or XML input, also performs morphological analysis of tokens into constituent morphemes
 
** [https://github.com/CopticScriptorium/tagger-part-of-speech Tagging and lemmatization] - models are available for use with [http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ TreeTagger]
 
** [https://github.com/CopticScriptorium/tagger-part-of-speech Tagging and lemmatization] - models are available for use with [http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ TreeTagger]
 +
** [https://github.com/CopticScriptorium/lexical-taggers Language of origin tagger] - useful for detecting Greek and other loanwords
 +
** [https://github.com/CopticScriptorium/normalizer Automatic normalization]
 +
 +
* Coreference resolution and NER for Coptic have been implemented in [http://corpling.uis.georgetown.edu/xrenner xrenner]
 +
  
 
[[Category:Resources by language|Coptic]]
 
[[Category:Resources by language|Coptic]]

Revision as of 07:31, 10 June 2016

Corpora

Tools

  • Coreference resolution and NER for Coptic have been implemented in xrenner