Data sets for NLG

2019-04-12T14:29:30Z

Dimitra: /* Data-to-text/Concept-to-text Generation */

Downloadable NLG systems

2019-04-12T14:25:56Z

Dimitra:

The natural language generation systems listed below are available for download over the web.
If you know of a system which is not listed here, you can email siggen-board@aclweb.org, or just click on Edit in the upper left corner of this page and add the system yourself.

== ASTROGEN ==
http://www.dsv.su.se/~hercules/ASTROGEN/ASTROGEN.html

Aggregated deep and Surface naTuRal language GENerator - Prolog based system.

== CRISP ==
http://code.google.com/p/crisp-nlg/

CRISP is Alexander Koller's NLG system that tries to cast both microplanning and sentence realisation as an AI planning problem. The code is a mixture of Java and Scala, a scripting language for the Java virtual machine. CRISP comes with its own implementation of GraphPlan, but it can also output plans in PDDL (“Planning Domain Definition Language”, a successor to STRIPS) for use with other AI planners. License: LGPL.

== FUF/SURGE ==
https://www.cs.bgu.ac.il/~elhadad/install-fuf.html

FUF/SURGE is a surface realisation system, based on functional unification grammar.

== GenI ==
http://kowey.github.io/GenI

GenI is a surface realiser for (Feature-Based Lexicalised) Tree Adjoining Grammar and a flat MRS-like semantics (sans top handle and underspecification). Toy example grammars provided for English and French. Largish core grammar for French is under development (contact us for details). GPL (commercial dual licensing available upon request). Known to work under Linux and Mac OS X (potential for making it work on Windows as well). Written in Haskell. Source code available via [http://hackage.haskell.org/package/GenI hackage], [https://github.com/kowey/GenI GitHub], or [http://hub.darcs.net/kowey/GenI hub.darcs.net].

== Grammar Explorer ==
http://www.fb10.uni-bremen.de/anglistik/langpro/kpml/tutorials/Grexplorer/grexplorer.html

The Grammar Explorer provides a means of exploring large-scale systemic-functional grammars in order to see how they are
organized and what kinds of things they cover. It can be used to explore the KPML resources.
Downloadable standalone executables of the grammar explorer are available for Windows 95/98/NT.
These already include a version of the Nigel grammar of English and pre-installed examples.

== jsRealB ==

http://rali.iro.umontreal.ca/rali/?q=en/jsrealb-bilingual-text-realiser

jsReakB is a bilingual (French and English) text realiser for web programming

== KPML ==

http://www.purl.org/net/kpml

The KPML system offers a robust, mature platform for large-scale grammar engineering that is particularly oriented to multilingual grammar development and generation. It is particularly targetted at providing resources for realistic but broad-coverage generation applications, where both flexibility of expression and speed of generation are at issue—for example in online webpage generation or spoken dialogue. KPML is also used extensively in multilingual text generation research and for teaching. It is based on systemic functional linguistics.

A growing set of generation grammars are under development for a variety of languages, inlcluding English, Spanish, Dutch, Chinese, German, Czech, and more. See the
Generation Bank (http://www.fb10.uni-bremen.de/anglistik/langpro/kpml/genbank/generation-bank.html )
for current examples. The development of further languages and of extensions to existing resources are very welcome!

== LKB ==
http://wiki.delph-in.net/moin/LkbTop

LKB (Linguistic Knowledge Builder) is a grammar engineering environment for unification-based formalisms, typically HPSG.
It includes a [http://wiki.delph-in.net/moin/LkbGeneration realiser] that takes as input Minimal Recursion Semantics (MRS). LKB is implemented in Common Lisp, and is freely available under an open source license. It includes also a KNOPPIX-based GNU/Linux live-CD, with all the system installed, ready to use.

== Multimodal Unification Grammar ==
http://www.david-reitter.com/compling/mug/

MUG Workbench is a development and debugging tool for Multimodal NLG. The grammar formalism supported is
Multimodal Functional Unification Grammar (MUG). The MUG system runs MUG grammars with fixed (test cases)
and arbitrary input specifications to produce output in a natural language, graphical user interface and
possibly in other modes. It is designed to do three things:
- Multimodal Fission (distributing output to interaction/communication modes)
- Some sentence planning (chosing information to include in the utterance)
- Natural Language and graphical user interface realization (producing some form of output)
The MUG system does these three jobs in parallel. MUG Workbench can serve to inspect the data-structures
used during generation. It should help you to learn more about the nature of unification grammars used
for parsing or natural language generation. Furthermore, the MUG Workbench is helpful in debugging your grammars.

== NaturalOWL ==
http://www.aueb.gr/users/ion/software/NaturalOWL1.1.tar.gz NaturalOWL (version 1.1)

Generates descriptions of entities and classes from OWL ontologies that have been annotated with linguistic and user modeling resources expressed in RDF. Currently supports English and Greek. Extensions for other languages welcome. NaturalOWL can also be used as a [http://protege.stanford.edu/ Protégé] plug-in. See [http://www.aueb.gr/users/ion/publications.html here] for publications describing NaturalOWL. (GPL)

== NLGen and NLGen2 ==
https://launchpad.net/nlgen

https://launchpad.net/nlgen2

The NLGen natural language generation system applies the [http://www.opencog.org/wiki/SegSim SegSim strategy] for generating English sentences. Probabilistic inference for sentence construction is based on a statistical analysis of [http://opencog.org/wiki/RelEx RelEx] output. Java, Apache license. See demo: [http://novamente.net/example/nlp.html Demo of AI Virtual Pet Answering Simple Questions].

NLGen2 uses [http://opencog.org/wiki/RelEx RelEx] dependency parses, together with [http://www.abisource.com/projects/link-grammar/ Link Grammar] linkage analysis to generate English-language output. Java, Apache license. Reference: Blake Lemoine, "[http://www.louisiana.edu/~bal2277/NLGen2.doc NLGen2: A Linguistically Plausible, General Purpose Natural Language Generation System]".

== OpenCCG ==
http://openccg.sourceforge.net/

OpenCCG is both a parser and a realizer for [[Combinatory Categorial Grammar]]. It has been used in several dialog systems. The realizer has been enhanced with n-gram models and a supertagging approach called hypertagging. OpenCCG is implemented in Java, and is freely available under the LGPL.

== rLDCP: Text Generation from Data ==
https://cran.r-project.org/web/packages/rLDCP/index.html

R package for text generation from data

== SimpleNLG ==

https://github.com/simplenlg/simplenlg (English)

http://www-etud.iro.umontreal.ca/~vaudrypl/snlgbil/snlgEnFr_english.html (French)

https://github.com/citiususc/SimpleNLG-GL (Galician)

https://github.com/citiususc/SimpleNLG-ES (Spanish)

SipleNLG is a simple Java-based realiser. Its grammatical coverage and syntactic knowledge is small compared to KPML or FUF/SURGE. However, because it is so simple, its relatively
easy for people to learn how to use it. It has a Java API, and can be used from other languages via an XML interface. There are "unofficial" ports to other programming languages such as Python and Ruby. Versions for other human languages are being worked on, including [https://aclweb.org/anthology/W18-6508 Dutch], [https://github.com/alexmazzei/SimpleNLG-IT Italian], [https://aclweb.org/anthology/papers/W/W18/W18-6506/ Mandarin]

== SPUD ==
http://www.cs.rutgers.edu/~mdstone/nlg.html

SPUD (Sentence Planner Using Descriptions) is Matthew Purver's LTAG-based NLG system. There are two versions: SPUD version 0.01 was written in SML. Later versions, known as SPUD lite, are written in Prolog. The small codebase of SPUD lite makes it ideal for teaching, but it is also used in dialog system prototypes.

== STANDUP ==
https://www.abdn.ac.uk/ncs/departments/computing-science/standup-315.php

STANDUP (System To Augment Non-speakers' Dialogue Using Puns) is a collaborative project on generating simple jokes from a graphical user interface appropriate for non-speaking children. The project began in October 2003 and ran until March 2007. The software was written in Java and is available for Windows and Linux, including source code and database files.

== Suregen-2 ==
http://www.suregen.de/00023.html

Suregen is “a hybrid, multilingual (German, English) ontology based and NLG-oriented formalism for generating text for documents in clinical medicine.”
The system Suregen-2 is written in (Allegro) Common Lisp. A [http://www.suregen.de/ftp/standalone1.zip demo system] which runs under Windows is available for download. A [http://www.suregen.de/ftp/selfrunningdemo.zip screencast video] shows data being entered into computer forms using mouse and keyboard while a feedback text is continually updated and shown below. (Try playing the AVI file in [http://www.videolan.org/vlc/ VLC] if you run into problems.) Perhaps this system could be considered an instance of the [http://en.wikipedia.org/wiki/WYSIWYM_(Meant) WYSIWYM] approach.

[[Category:Software]]
{{SIGGEN Wiki}}

Data sets for NLG

2019-04-12T14:20:27Z

Dimitra:

2019-01-28T14:02:50Z

Dimitra: /* Board */

__NOTOC__

<h1>ACL Special Interest Group on Natural Language Generation </h1>

{|
|-
|[[File:Siggen_logo_small.JPG|left]]||<h4 style="width:95%;margin:0;background-color:#cedff2;font-size:120%;font-weight:bold;border:1px solid #a3b0bf;text-align:justify;color:#000;padding:0.2em 0.4em;">Welcome to the home page of the Association for Computational Linguistics Special Interest Group on Natural Language Generation. SIGGEN [ˈsɪɡ.ʤɛn] is a special interest group of the Association for Computational Linguistics (ACL). It provides a forum for the discussion, dissemination and archiving of research topics and results in the field of text generation. </h4>

|}

Active topics of interest include:

*Discourse models, content planning.
*Syntactic realization: formalisms and models of grammars for sentence production.
*Architecture of generators.
*Lexical choice.
*Psychological modelling of discourse production.
*Pragmatic influences on lexical choice, syntax and content selection.
*Multilingual or multi-modal generation.
*Applications of generation technology (report generation, explanation for knowledge-based systems, automatic translation...).
*Learning methods.
*Evaluation of generation results.

Relevant aspects of the following areas relate to problems of natural language generation:

*Grammar theory
*Statistical methods
*Speech synthesis
*Psycholinguistics
*Neuroscience
*Philosophy

== Upcoming Events ==

INLG 2019 will be announced soon!

== Recent Events ==

[https://inlg2018.uvt.nl/ INLG 2018]

Tilburg, Netherlands, 5-8 Novemeber 2018

== Mailing List ==
=== Joining the mailing list: ===

:The SIGGEN mailing list is currently going through a transition.
:To sign up, view preferences, change preferences, or unsubscribe, go to:

::'''[http://www.jiscmail.ac.uk/SIGGEN http://www.jiscmail.ac.uk/SIGGEN]'''

:If there are any issues, e-mail: <u>'''siggen-webmaster (ta) aclweb (dot) org'''</u>.

=== Posting messages to the mailing list ===

:Please join the mailing list first (see above). Then you may use the email alias <u>'''siggen-list (ta) aclweb (dot) org'''</u> to post e-mails to the list.

== Board ==
The SIGGEN board is made up of the following people:

*[https://ehudreiter.com/Ehud Reiter] ([mailto:e.reiter@abdn.ac.uk mail]) Professor/Chair in Computer Science at [https://www.abdn.ac.uk/ncs/profiles/e.reiter/] University of Aberdeen. [mailto:siggen-chair(ta)aclweb(dot)org chair])
:elected in December 2018 for the period from 1st January 2019 to 31st December 2022
*[https://dimitragkatzia.wordpress.com Dimitra Gkatzia] ([mailto:d.gkatzia@napier.ac.uk mail]) [http://www.napier.ac.uk/about-us/our-schools/school-of-computing/staff School of Computing, Edinburgh Napier University], Edinburgh.
:elected in December 2016 for the period from 1st January 2017 to 31st December 2020
*[http://amandastent.com// Amanda Stent] ([mailto:amanda.stent@gmail.com mail]), Bloomberg LP ([mailto:siggen-treasurer(ta)aclweb(dot)org treasurer])
:elected in December 2016 for the period from 1st January 2017 to 31st December 2020
*[https://citius.usc.es/equipo/investigadores-postdoutorais/jose-maria-alonso-moral Jose M. Alons] ([mailto:josemaria.alonso.moral@usc.es]) [ University of Santiago de Compostela, Spain] (secretary)
:elected in December 2018 for the period from 1st January 2019 to 31st December 2022
*[http://homepages.inf.ed.ac.uk/amyi Amy Isard] ([mailto:amy.isard@ed.ac.uk mail]) [http://www.inf.ed.ac.uk School of Informatics, University of Edinburgh] (student member)
:elected in December 2018 for the period from 1st January 2019 to 31st December 2020

To contact the entire board, please use the email alias: <u>'''siggen-board (ta) aclweb (dot) org'''</u>.

For questions regarding this website, please email: <u>'''siggen-webmaster (ta) aclweb (dot) org'''</u>.

== [http://www.aclweb.org/anthology/siggen.html Workshop Proceedings ] ==

== [[SIGGEN: Archive|Archive]] ==
== [[SIGGEN: Newsletter Archive|Newsletter Archive]] ==
== [[SIGGEN: Constitution|Constitution]] ==
== [[SIGGEN: Who's Who in NLG|Who's Who in NLG]] ==
== [[SIGGEN: What's Where in NLG|What's Where in NLG]] ==

== Resources ==
[[Natural_Language_Generation_Portal|Natural Language Generation Portal]]

SIGGEN

2019-01-28T13:57:05Z

Dimitra: /* Board */

__NOTOC__

<h1>ACL Special Interest Group on Natural Language Generation </h1>

{|
|-
|[[File:Siggen_logo_small.JPG|left]]||<h4 style="width:95%;margin:0;background-color:#cedff2;font-size:120%;font-weight:bold;border:1px solid #a3b0bf;text-align:justify;color:#000;padding:0.2em 0.4em;">Welcome to the home page of the Association for Computational Linguistics Special Interest Group on Natural Language Generation. SIGGEN [ˈsɪɡ.ʤɛn] is a special interest group of the Association for Computational Linguistics (ACL). It provides a forum for the discussion, dissemination and archiving of research topics and results in the field of text generation. </h4>

|}

Active topics of interest include:

*Discourse models, content planning.
*Syntactic realization: formalisms and models of grammars for sentence production.
*Architecture of generators.
*Lexical choice.
*Psychological modelling of discourse production.
*Pragmatic influences on lexical choice, syntax and content selection.
*Multilingual or multi-modal generation.
*Applications of generation technology (report generation, explanation for knowledge-based systems, automatic translation...).
*Learning methods.
*Evaluation of generation results.

Relevant aspects of the following areas relate to problems of natural language generation:

*Grammar theory
*Statistical methods
*Speech synthesis
*Psycholinguistics
*Neuroscience
*Philosophy

== Upcoming Events ==

INLG 2019 will be announced soon!

== Recent Events ==

[https://inlg2018.uvt.nl/ INLG 2018]

Tilburg, Netherlands, 5-8 Novemeber 2018

== Mailing List ==
=== Joining the mailing list: ===

:The SIGGEN mailing list is currently going through a transition.
:To sign up, view preferences, change preferences, or unsubscribe, go to:

::'''[http://www.jiscmail.ac.uk/SIGGEN http://www.jiscmail.ac.uk/SIGGEN]'''

:If there are any issues, e-mail: <u>'''siggen-webmaster (ta) aclweb (dot) org'''</u>.

=== Posting messages to the mailing list ===

:Please join the mailing list first (see above). Then you may use the email alias <u>'''siggen-list (ta) aclweb (dot) org'''</u> to post e-mails to the list.

== Board ==
The SIGGEN board is made up of the following people:

*[https://ehudreiter.com/Ehud Reiter] ([mailto:e.reiter@abdn.ac.uk mail]) Professor/Chair in Computer Science at [https://www.abdn.ac.uk/ncs/profiles/e.reiter/] University of Aberdeen. [mailto:siggen-chair(ta)aclweb(dot)org chair])
:elected in December 2018 for the period from 1st January 2019 to 31st December 2022
*[https://dimitragkatzia.wordpress.com Dimitra Gkatzia] ([mailto:d.gkatzia@napier.ac.uk mail]) [http://www.napier.ac.uk/about-us/our-schools/school-of-computing/staff School of Computing, Edinburgh Napier University], Edinburgh.
:elected in December 2016 for the period from 1st January 2017 to 31st December 2020
*[http://amandastent.com// Amanda Stent] ([mailto:amanda.stent@gmail.com mail]), Bloomberg LP ([mailto:siggen-treasurer(ta)aclweb(dot)org treasurer])
:elected in December 2016 for the period from 1st January 2017 to 31st December 2020
*[https://dimitragkatzia.wordpress.com Dimitra Gkatzia] ([mailto:d.gkatzia@napier.ac.uk mail]) [http://www.napier.ac.uk/about-us/our-schools/school-of-computing/staff School of Computing, Edinburgh Napier University], Edinburgh (secretary)
:elected in December 2018 for the period from 1st January 2019 to 31st December 2022
*[http://homepages.inf.ed.ac.uk/amyi Amy Isard] ([mailto:amy.isard@ed.ac.uk mail]) [http://www.inf.ed.ac.uk School of Informatics, University of Edinburgh] (student member)
:elected in December 2018 for the period from 1st January 2019 to 31st December 2020

To contact the entire board, please use the email alias: <u>'''siggen-board (ta) aclweb (dot) org'''</u>.

For questions regarding this website, please email: <u>'''siggen-webmaster (ta) aclweb (dot) org'''</u>.

== [http://www.aclweb.org/anthology/siggen.html Workshop Proceedings ] ==

== [[SIGGEN: Archive|Archive]] ==
== [[SIGGEN: Newsletter Archive|Newsletter Archive]] ==
== [[SIGGEN: Constitution|Constitution]] ==
== [[SIGGEN: Who's Who in NLG|Who's Who in NLG]] ==
== [[SIGGEN: What's Where in NLG|What's Where in NLG]] ==

== Resources ==
[[Natural_Language_Generation_Portal|Natural Language Generation Portal]]