Difference between revisions of "Resources for Coptic"

From ACL Wiki
Jump to navigation Jump to search
(Added Coptic resources page)
 
(URLs)
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
 
==Corpora==
 
==Corpora==
* Multiple corpora from the project Coptic Scriptorium are available for download from http://copticscriptorium.org under a CC-BY license
+
* Multiple corpora from the project Coptic Scriptorium are available for download from https://copticscriptorium.org under a CC-BY license
* The Coptic Universal Dependency Treebank is available at http://corpling.uis.georgetown.edu/coptic-treebank
+
* The Coptic Universal Dependency Treebank is available at https://copticscriptorium.org/treebank.html
  
 
==Tools==
 
==Tools==
* An online NLP pipeline interface and API are available at http://corpling.uis.georgetown.edu/coptic-nlp
+
* An online NLP pipeline interface and API are available at https://tools.copticscriptorium.org/coptic-nlp/ . Also includes a dependency parser.
* Individual tools:
+
* Individual command line tools used in the pipeline:
** [https://github.com/CopticScriptorium/tokenizers Tokenizer] - for UTF-8 plain text or XML input
+
** [https://github.com/CopticScriptorium/tokenizers Tokenizer] - for UTF-8 plain text or XML input, also performs morphological analysis of tokens into constituent morphemes
 
** [https://github.com/CopticScriptorium/tagger-part-of-speech Tagging and lemmatization] - models are available for use with [http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ TreeTagger]
 
** [https://github.com/CopticScriptorium/tagger-part-of-speech Tagging and lemmatization] - models are available for use with [http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/ TreeTagger]
 +
** [https://github.com/CopticScriptorium/lexical-taggers Language of origin tagger] - useful for detecting Greek and other loanwords
 +
** [https://github.com/CopticScriptorium/normalizer Automatic normalization]
 +
 +
* Coreference resolution and NER for Coptic have been implemented in [https://gucorpling.org/xrenner xrenner]
 +
  
 
[[Category:Resources by language|Coptic]]
 
[[Category:Resources by language|Coptic]]

Latest revision as of 10:04, 16 September 2022

Corpora

Tools

  • Coreference resolution and NER for Coptic have been implemented in xrenner