<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.aclweb.org/adminwiki/index.php?action=history&amp;feed=atom&amp;title=2008Q3_Reports%3A_Anthology</id>
	<title>2008Q3 Reports: Anthology - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.aclweb.org/adminwiki/index.php?action=history&amp;feed=atom&amp;title=2008Q3_Reports%3A_Anthology"/>
	<link rel="alternate" type="text/html" href="https://www.aclweb.org/adminwiki/index.php?title=2008Q3_Reports:_Anthology&amp;action=history"/>
	<updated>2026-05-25T13:06:30Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.6</generator>
	<entry>
		<id>https://www.aclweb.org/adminwiki/index.php?title=2008Q3_Reports:_Anthology&amp;diff=180&amp;oldid=prev</id>
		<title>StevenBird: 2008 Summer Reports: Anthology moved to 2008Q3 Reports: Anthology</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/adminwiki/index.php?title=2008Q3_Reports:_Anthology&amp;diff=180&amp;oldid=prev"/>
		<updated>2008-11-25T22:51:37Z</updated>

		<summary type="html">&lt;p&gt;&lt;a href=&quot;/adminwiki/index.php?title=2008_Summer_Reports:_Anthology&quot; class=&quot;mw-redirect&quot; title=&quot;2008 Summer Reports: Anthology&quot;&gt;2008 Summer Reports: Anthology&lt;/a&gt; moved to &lt;a href=&quot;/adminwiki/index.php?title=2008Q3_Reports:_Anthology&quot; title=&quot;2008Q3 Reports: Anthology&quot;&gt;2008Q3 Reports: Anthology&lt;/a&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 22:51, 25 November 2008&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;en&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(No difference)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>StevenBird</name></author>
	</entry>
	<entry>
		<id>https://www.aclweb.org/adminwiki/index.php?title=2008Q3_Reports:_Anthology&amp;diff=132&amp;oldid=prev</id>
		<title>StevenBird: New page: ACL ANTHOLOGY Report, May 2008 Min-Yen Kan  The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, and freely available to a...</title>
		<link rel="alternate" type="text/html" href="https://www.aclweb.org/adminwiki/index.php?title=2008Q3_Reports:_Anthology&amp;diff=132&amp;oldid=prev"/>
		<updated>2008-11-25T02:03:20Z</updated>

		<summary type="html">&lt;p&gt;New page: ACL ANTHOLOGY Report, May 2008 Min-Yen Kan  The ACL Anthology is a digital archive of research papers in computational linguistics, sponsored by the CL community, and freely available to a...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;ACL ANTHOLOGY Report, May 2008&lt;br /&gt;
Min-Yen Kan&lt;br /&gt;
&lt;br /&gt;
The ACL Anthology is a digital archive of research papers in computational&lt;br /&gt;
linguistics, sponsored by the CL community, and freely available to all. &lt;br /&gt;
It includes the Computational Linguistics journal, and proceedings of many&lt;br /&gt;
conferences and workshops including: ACL, EACL, NAACL, ANLP, TINLAP, COLING,&lt;br /&gt;
HLT, MUC, and Tipster.  Conference proceedings are published in the anthology&lt;br /&gt;
around the same time as the conference.  CL articles are published in the&lt;br /&gt;
anthology roughly one year in arrears (but individual subscribers can access&lt;br /&gt;
recent issues electronically via the MIT Press website).&lt;br /&gt;
&lt;br /&gt;
The anthology now contains over 13,600 papers (up from 12,500 papers twelve&lt;br /&gt;
months ago), along with full-text search (provided by Google&amp;#039;s Custom Search&lt;br /&gt;
API).  Most of the papers are also indexed by Citeseer and Google Scholar,&lt;br /&gt;
helping the citation counts of ACL authors.  e.g. the following Google Scholar&lt;br /&gt;
search reported nearly 8,000 results:&lt;br /&gt;
http://scholar.google.com/scholar?q=site%3Aacl.ldc.upenn.edu.  The ACM Digital&lt;br /&gt;
Library is creating rich metadata and doing full citation linking for all&lt;br /&gt;
anthology materials.&lt;br /&gt;
&lt;br /&gt;
CHANGES IN EDITORS: Steven Bird stepped down after six years of service in&lt;br /&gt;
starting the initial anthology project and tracking down, acquiring, converting&lt;br /&gt;
and correcting almost all of the ACL&amp;#039;s backdated publications and ingesting&lt;br /&gt;
them into the Anthology.  Min-Yen Kan was appointed to take over Steven&amp;#039;s role&lt;br /&gt;
and assumed editorship in January 2008.&lt;br /&gt;
&lt;br /&gt;
ADDITIONS OVER LAST 12 MONTHS: I has been ingesting missing materials from the&lt;br /&gt;
Anthology including EMNLP 01; EMNLP 04, EACL 03 Workshops, MUC 98, SIGdial &amp;#039;03,&lt;br /&gt;
SIGdial &amp;#039;04.  With these additions we have an almost complete archive of&lt;br /&gt;
related ACL materials up to the recent present.  I have also linked to the MT&lt;br /&gt;
Archives that houses back issues of Mechanical Translation and Computational&lt;br /&gt;
Linguistics.  Coupled together, we have a digital archive of CL related&lt;br /&gt;
materials from 1954-2006.  Constant, smaller additions of current materials is&lt;br /&gt;
likely to be the focus now.  Towards this goal, I have also updated the&lt;br /&gt;
Anthology with recent materials from: CL Vol 32 (&amp;#039;06); Euro Workshop on NLG 07.&lt;br /&gt;
&lt;br /&gt;
MAILING LIST: The Anthology mailing list&amp;#039;s&lt;br /&gt;
(http://groups.google.com/group/acl-anthology) membership pool has grown, now&lt;br /&gt;
consisting of 94 members.  This is an annoucement-only list. &lt;br /&gt;
&lt;br /&gt;
HOSTING: The Anthology is now hosted on ACL&amp;#039;s own website.  The LDC website is&lt;br /&gt;
no longer authoritative.  A web redirect has been set up to re-route traffic&lt;br /&gt;
appropriately&lt;br /&gt;
&lt;br /&gt;
FACELIFT: A change in the HTML code of the Anthology was done in February,&lt;br /&gt;
after piloting with the mailing list group&amp;#039;s members in January.  The facelift&lt;br /&gt;
was done to simplify the HTML code for maintainence, and to factor stylistic&lt;br /&gt;
rendering from the HTML code into an Anthology-wide stylesheet.&lt;br /&gt;
&lt;br /&gt;
SIG PAGES: Each SIG now contributes its own Anthology page.  These are to be&lt;br /&gt;
maintained by each SIG exec committee though a configuration file (in YAML&lt;br /&gt;
format).  SIGs will send updates to these configuration files to me for editing&lt;br /&gt;
and insertion into the live website.  Note that this is an interim measure --&lt;br /&gt;
see ONGOING ACTIVITIES&lt;br /&gt;
&lt;br /&gt;
FUTURE MATERIALS: Aside from regular ACL meetings, currently, IJCNLP 05 is&lt;br /&gt;
scheduled to be ingested in December when the window for exclusive copyright&lt;br /&gt;
expires with Springer.  IJCNLP 08 is also in the process of being ingested.&lt;br /&gt;
&lt;br /&gt;
DIGITAL OBJECT IDENTIFIERS: DOIs are akin to ISBN numbers, but apply to&lt;br /&gt;
individual papers.  They are now the standard way to uniquely identify an&lt;br /&gt;
academic paper, and web services will be available for resolving DOIs to papers&lt;br /&gt;
(e.g. http://dx.doi.org/).  ACM helps us in assigning DOIs to published ACL&lt;br /&gt;
materials.  I&amp;#039;m working with them to make DOI assignment more timely and&lt;br /&gt;
investigating whether we can have DOIs assigned to papers as they are published&lt;br /&gt;
(so that each paper&amp;#039;s copyright notice may be able to print its own DOI).&lt;br /&gt;
&lt;br /&gt;
PUBLICATION INSTRUCTIONS: I am proactively attempting to contact each ACL&lt;br /&gt;
event&amp;#039;s publication chair to ensure that they know the process to have their&lt;br /&gt;
proceedings ingested into the Anthology.  In this way we can try to minimize&lt;br /&gt;
the lag between publication and appearance in the Anthology (when ACL is the&lt;br /&gt;
sole publisher).&lt;br /&gt;
&lt;br /&gt;
ONGOING ACTIVITIES:&lt;br /&gt;
&lt;br /&gt;
HIGHER-QUALITY BIBLIOGRAPHIC METADATA: The ACM Digital Library is creating&lt;br /&gt;
high-quality bibliographic metadata for each individual paper, in conjunction&lt;br /&gt;
with registering each paper with a DOI.  It should be possible to extract that&lt;br /&gt;
metadata and improve the quality of metadata on the Anthology site (e.g.,&lt;br /&gt;
removing OCR errors in the spelling of author and paper names).  I will also be&lt;br /&gt;
manually verifying and editing records in the Anthology on a regular and&lt;br /&gt;
systematic basis.&lt;br /&gt;
&lt;br /&gt;
WIKIFIED EDITING: I plan to bring the metadata of the Anthology into a Wiki&lt;br /&gt;
form that allows editing to be easily done by the general public.  I plan to&lt;br /&gt;
start with a pilot data and user set -- the SIG pages -- and expand the program&lt;br /&gt;
if it is successful and poses limited security problems.  I plan to roll this&lt;br /&gt;
out on a trial basis in late 2008.&lt;br /&gt;
&lt;br /&gt;
INTEGRATION WITH OTHER GRASSROOTS PROJECTS: A number of grassroots projects as&lt;br /&gt;
proposed at ACL 2007 center around the Anthology.  I plan to organize and&lt;br /&gt;
incorporate as much user contributed data as possible, where feasible.  These&lt;br /&gt;
would include the Anthology Network, Video Anthology and the Linked Anthology&lt;br /&gt;
proposals.  Thus far, part of the Anthology Network and the raw text extraction&lt;br /&gt;
has dovetailed together nicely on a standardized subset of the Anthology.&lt;/div&gt;</summary>
		<author><name>StevenBird</name></author>
	</entry>
</feed>