<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/adminwiki/index.php?action=history&amp;feed=atom&amp;title=2008Q1_Reports%3A_Anthology</id>
	<title>2008Q1 Reports: Anthology - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/adminwiki/index.php?action=history&amp;feed=atom&amp;title=2008Q1_Reports%3A_Anthology"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/adminwiki/index.php?title=2008Q1_Reports:_Anthology&amp;action=history"/>
	<updated>2026-06-01T05:04:55Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/adminwiki/index.php?title=2008Q1_Reports:_Anthology&amp;diff=238&amp;oldid=prev</id>
		<title>StevenBird: 2008 Winter Reports: Anthology moved to 2008Q1 Reports: Anthology</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/adminwiki/index.php?title=2008Q1_Reports:_Anthology&amp;diff=238&amp;oldid=prev"/>
		<updated>2008-11-25T22:58:51Z</updated>

		<summary type="html">&lt;p&gt;&lt;a href=&quot;/adminwiki/index.php?title=2008_Winter_Reports:_Anthology&quot; class=&quot;mw-redirect&quot; title=&quot;2008 Winter Reports: Anthology&quot;&gt;2008 Winter Reports: Anthology&lt;/a&gt; moved to &lt;a href=&quot;/adminwiki/index.php?title=2008Q1_Reports:_Anthology&quot; title=&quot;2008Q1 Reports: Anthology&quot;&gt;2008Q1 Reports: Anthology&lt;/a&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 22:58, 25 November 2008&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;en&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(No difference)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>StevenBird</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/adminwiki/index.php?title=2008Q1_Reports:_Anthology&amp;diff=94&amp;oldid=prev</id>
		<title>StevenBird: New page: &lt;pre&gt; ACL ANTHOLOGY Report, January 2008 Steven Bird &amp; Min Yen Kan  The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, a...</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/adminwiki/index.php?title=2008Q1_Reports:_Anthology&amp;diff=94&amp;oldid=prev"/>
		<updated>2008-11-25T01:28:12Z</updated>

		<summary type="html">&lt;p&gt;New page: &amp;lt;pre&amp;gt; ACL ANTHOLOGY Report, January 2008 Steven Bird &amp;amp; Min Yen Kan  The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, a...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&amp;lt;pre&amp;gt;&lt;br /&gt;
ACL ANTHOLOGY Report, January 2008&lt;br /&gt;
Steven Bird &amp;amp; Min Yen Kan&lt;br /&gt;
&lt;br /&gt;
The ACL Anthology is a digital archive of research papers in&lt;br /&gt;
computational linguistics, sponsored by the CL community, and freely&lt;br /&gt;
available to all.  It includes the Computational Linguistics journal,&lt;br /&gt;
and proceedings of many conferences and workshops including: ACL,&lt;br /&gt;
EACL, NAACL, ANLP, TINLAP, COLING, HLT, MUC, and Tipster.  Conference&lt;br /&gt;
proceedings are published in the anthology around the same time as the&lt;br /&gt;
conference.  CL articles are published in the anthology one year in&lt;br /&gt;
arrears (but individual subscribers can access recent issues&lt;br /&gt;
electronically via the MIT Press website).&lt;br /&gt;
&lt;br /&gt;
The anthology now contains 14,000 papers (up from 12,500 papers twelve&lt;br /&gt;
months ago), along with full-text search.  The materials are now&lt;br /&gt;
hosted on the ACL website, at http://aclweb.org/anthology-index/,&lt;br /&gt;
thanks to Drago Radev.  Most of the papers are also indexed by&lt;br /&gt;
Citeseer and Google Scholar, helping the citation counts of ACL&lt;br /&gt;
authors.  The ACM Digital Library creates full metadata for all&lt;br /&gt;
anthology materials and registers digital object identifiers for ACL&lt;br /&gt;
papers (e.g. http://dx.doi.org/10.3115/1118693.1118695), costing the&lt;br /&gt;
ACL $275 annually.  The new AAN ACL Anthology Network website at&lt;br /&gt;
Michigan provides detailed citation analysis for the anthology.&lt;br /&gt;
Updates to the anthology are announced on the mailing list at&lt;br /&gt;
http://groups.google.com/group/acl-anthology&lt;br /&gt;
&lt;br /&gt;
Steven Bird has now stepped down as editor, and has passed on the role&lt;br /&gt;
to Min-Yen Kan.  This transition marks the conclusion of the&lt;br /&gt;
development phase of the Anthology: (a) materials from the ACL&amp;#039;s&lt;br /&gt;
hardcopy and microfiche eras are now all digitized; (b) born-digital&lt;br /&gt;
materials published in ad hoc formats have been manually converted;&lt;br /&gt;
(c) the anthology has been incorporated into the ACL&amp;#039;s operation,&lt;br /&gt;
including the publications process and web hosting.  The ongoing&lt;br /&gt;
maintenance of the anthology involves several challenges: streamlining&lt;br /&gt;
the proceedings upload process; incorporating richer bibliographic&lt;br /&gt;
metadata as it becomes available via DOI services, and supporting&lt;br /&gt;
community initiatives that build on the Anthology.&lt;br /&gt;
&lt;br /&gt;
ONGOING ACTIVITIES&lt;br /&gt;
&lt;br /&gt;
PACLIC PROCEEDINGS: The steering committee of PACLIC -- the Pacific&lt;br /&gt;
Asia Conference on Language, Information and Computation -- has&lt;br /&gt;
approached the Anthology editor to request that PACLIC proceedings be&lt;br /&gt;
included in the Anthology.  This has been an important regional&lt;br /&gt;
conference covering language in the Pacific Asian region over the past&lt;br /&gt;
twenty years.  Recently, with great help from Professor Harada&amp;#039;s team&lt;br /&gt;
at Waseda University, all PACLIC proceedings have been digitized, and&lt;br /&gt;
posted at http://www.decode.waseda.ac.jp/PACLIC-STEERING/.  Including&lt;br /&gt;
these materials would add to the geographical and linguistic diversity&lt;br /&gt;
of the Anthology.  The Executive needs to establish the scope of the&lt;br /&gt;
Anthology beyond the ACL&amp;#039;s own publications.&lt;br /&gt;
&lt;br /&gt;
IJCNLP PROCEEDINGS: The 2005 proceedings were excluded from the ACL&lt;br /&gt;
Anthology because of an agreement with Springer.  Once the required&lt;br /&gt;
three year period elapses, during 2008, the IJCNLP-05 proceedings can&lt;br /&gt;
be incorporated into the Anthology.  Su Jian is the contact person for&lt;br /&gt;
organizing this.  IJCNLP-08 proceedings will also be processed into&lt;br /&gt;
the anthology at a later date this year, pending the final list of&lt;br /&gt;
archived papers from the IJCNLP conference chairs.&lt;br /&gt;
&lt;br /&gt;
HIGHER-QUALITY BIBLIOGRAPHIC METADATA: The ACM Digital Library is&lt;br /&gt;
creating high-quality bibliographic metadata for each individual&lt;br /&gt;
paper, in conjunction with registering each paper with a DOI.  It&lt;br /&gt;
should be possible to extract that metadata and improve the quality of&lt;br /&gt;
metadata on the Anthology site (e.g. removing OCR errors in the&lt;br /&gt;
spelling of author and paper names).&lt;br /&gt;
&lt;br /&gt;
PUBLICATION INSTRUCTIONS: The instructions for the publication&lt;br /&gt;
software need to be updated to cover two further tasks: (i) obtaining&lt;br /&gt;
the workshop identifiers from the Anthology editor, and (ii) uploading&lt;br /&gt;
the materials to the anthology by FTP.  Conferences and workshops not&lt;br /&gt;
held in conjunction with a regular ACL meeting are not automatically&lt;br /&gt;
included in the Anthology.  Organizers of such events shound consider&lt;br /&gt;
using the ACL publication software and contacting the Anthology editor&lt;br /&gt;
to ensure timely incorporation of the proceedings in the Anthology.&lt;br /&gt;
&lt;br /&gt;
SIG RELATED MATERIALS: Min is now working on expanding the scope of&lt;br /&gt;
Anthology materials where feasible.  In particular, SIGs are likely to&lt;br /&gt;
have their own specialized Anthology pages, featuring links to&lt;br /&gt;
materials of relevance or supported by each SIG.  Once this is done,&lt;br /&gt;
we hope to expand the archiving of materials to workshops/conference&lt;br /&gt;
related to SIGs.&lt;br /&gt;
&lt;br /&gt;
TIMING: Conference and workshop organizers have a variety of opinions&lt;br /&gt;
about exactly when proceedings should appear in the Anthology&lt;br /&gt;
(e.g. before, during, or after the event).  We recommend that the&lt;br /&gt;
Executive establish a standard practice here.&lt;br /&gt;
&lt;br /&gt;
ACM DL: Our ACM Digital Library contact, Bernard Rous, has asked to&lt;br /&gt;
receive CD-ROMs of ACL conferences as they are published, so that he&lt;br /&gt;
can initiate the process of assigning DOIs.  His address is:&lt;br /&gt;
Bernard Rous, Electronic Publishing Program Director,&lt;br /&gt;
ACM, 2 Penn Plaza Suite 701, New York NY 10121-0701&lt;br /&gt;
&lt;br /&gt;
TEXT EXTRACTION: There is an ongoing initiative to extract plain text&lt;br /&gt;
from the ACL Anthology materials, involving Dragomir Radev, Min-Yen&lt;br /&gt;
Kan and others.  Most of the Anthology has been converted, and can be&lt;br /&gt;
found at http://wing.comp.nus.edu.sg/~min/dAnth/acl/.  This will&lt;br /&gt;
facilitate the application of NLP techniques to our own publications.&lt;br /&gt;
In particular, the Linked anthology proposal submitted to the ACL&lt;br /&gt;
Exec grassroots initiative plans to create standardized test corpus&lt;br /&gt;
for future bibliographic and bibliometric studies, which we expect to&lt;br /&gt;
be reported later this year.&lt;br /&gt;
&lt;br /&gt;
TOPICAL INDEXING: The existence of persistent URLs makes it easy for&lt;br /&gt;
individuals and special interest groups to set up annotated&lt;br /&gt;
bibliographies with pointers to papers in the anthology.  Moreover,&lt;br /&gt;
the community&amp;#039;s own text categorization techniques ought to be applied&lt;br /&gt;
to its own text collection.  The anthology site should link to any&lt;br /&gt;
well-curated, comprehensive categorizations of its content, so that&lt;br /&gt;
members of the CL community can benefit from them.  The new ACL Wiki&lt;br /&gt;
would be a convenient place for members to maintain topical indexes of&lt;br /&gt;
ACL papers.&lt;br /&gt;
&lt;br /&gt;
WIKIFIED EDITING: On a more long-term schedule for late this year is&lt;br /&gt;
to have the Anthology incorporate edits from the user community. These&lt;br /&gt;
edits to metadata would be reviewed by the Anthology editor but such&lt;br /&gt;
feedback would be made much easier from the context of the users&lt;br /&gt;
themselves.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>StevenBird</name></author>
	</entry>
</feed>