2009Q3 Reports: NAACL 2009

From Admin Wiki
Revision as of 10:25, 4 July 2009 by LoriLevin (talk | contribs) (New page: NAACL HLT Conference Chair Report As General Chair, my goal for the 2009 NAACL HLT Conference was to have high quality technical presentations in all areas, but particularly to increase t...)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

NAACL HLT Conference Chair Report

As General Chair, my goal for the 2009 NAACL HLT Conference was to have high quality technical presentations in all areas, but particularly to increase the number and quality of papers from the speech and IR areas. In addition, I was encouraged by the NAACL Board to explore ways to better engage researchers from industry.

The ACL-HLT 2008 conference chair raised the issue of imbalance in the speech/IR/NLP fields. There were many more submissions in NLP than in the other two areas. She recommended either changing the structure of committees to reflect the actual balance of papers or having an “additional coordinated event that involves invitations to members of the community in order to draw more participation.” For NAACL HLT 2009, we maintained the area distribution for the area chairs, publicity and tutorials, but not for other committees. For committees with fewer chairs, we tried to choose people whose expertise spanned multiple disciplines, which seemed to work well. It would probably be reasonable to take a similar strategy for the PC chairs next year, i.e. have only 3 PC chairs with 2 from NLP and one from speech, with one of the three having a tie to IR. (We had one person serve as lead for long papers (Mike Collins) and one as lead for short papers (Lucy Vanderwende), and this worked very well.)

In order to try to attract more papers in the speech and IR areas, we included two special sessions targeting the cross-cutting topics of large scale language processing (IR and NLP) and speech indexing and retrieval (speech and IR). The area chairs for the speech indexing session actively recruited paper submissions, which contributed to the session’s success. The large scale language processing area had several submissions without active recruiting other than our highlighting it in the call for papers.

In an attempt to better engage industry researchers, we organized a lunchtime panel discussion on “Emerging Application Areas in Computational Linguistics,” which included representatives from different application areas and different size companies. Thanks to Bill Dolan for organizing and moderating the discussion. Box lunches were available for purchase. The panel discussion was very well attended. One problem was that the panel started late because of the long line for getting lunches, which involved participants composing their lunches. Possibilities for improving this include having the lunches pre-boxed to simplify pickup logistics and moving the lunch pickup outside of the panel discussion room. In addition to the industry panel, another success with industry is that the tutorial chairs actively recruited some more practically oriented panels, which were very well received.

Two web pages were available to provide guidance: http://aclweb.org/adminwiki/index.php?title=Conference_Handbook http://www.cs.jhu.edu/~jason/advice/how-to-chair-a-conference

While I should have done a better job in making all chairs aware of the guidelines, our biggest problem (coordinating schedules with other conferences) is not addressed in these guidelines. In addition, it would be nice to have guidance on other HLT-specific issues (e.g. demos). NAACL might want to develop an additional web page covering special considerations for NAACL HLT.

Several changes were instituted this year, including:

  • Multi-conference coordination of sponsorship
  • Multi-conference coordination of workshops
  • New format for short paper reviewing and explicit call for different types of short

papers

  • Including the student research workshop as a parallel session within the main

conference

  • Allowing students whose papers were accepted to the student session to also

(optionally) present a poster in the main conference poster session.

All changes seemed to work well with a couple of exceptions. Most importantly, there was a lack of clarity of responsibility among the different sponsorship chairs and a lack of connection of the regional sponsorship chairs to the NAACL HLT 2009 meeting. The general idea of multi-conference coordination makes sense, but it is necessary to clarify the role of the local chair and to make this person the main point of contact. Second, as noted in the PC chairs’ report, there were not many papers in the “negative result” and “opinion piece” categories of short papers. The program committee also felt that it would be better to have the author indicate the type of short paper, in addition to the reviewers. The total funds raised through sponsorship efforts was $27,350, which includes one $4000 contribution that resulted from the multi-conference sponsorship coordination. The Local Arrangements Chairs were very helpful in this effort in addition to the Local Sponsorship Chair. Companies contributing included: Rosetta Stone, CNGL, Google, AT&T, Language Weaver, JD Powers, IBM Research, CLEAR, HLT, LDC, and John Benjamins. One difficulty that arose was a last minute decision that funds from a sponsor would go to a student travel award, which caused some minor program glitches. We recommend that an advance deadline be set for designating donated funds for new awards.


Based on discussions during and after the conference, additional areas where we think there could be improvements include:

  • Parallel review of short papers: A possible reason for serial review was to allow

papers submitted as long papers and rejected to be resubmitted as short papers. Very few short papers are accepted this way, and authors can always submit to other conferences. Parallel review would make scheduling much easier and would make the reviewing process less complicated for the program committee.

  • Demos: We had very few demos this year, but many people like the tradition.

Ideas for addressing this include eingither assign the Demo Chairs the task of actively recruiting demos, or giving authors of accepted papers the opportunity to present as a demo.

  • Poster session: The evening poster session combined with a reception was well

received by poster viewers, but less so by poster presenters. People who had posters in the first session didn’t get much food, and people who had their posters in the later session had a smaller audience. Some ideas for improving this include: having a better enforced time period for poster presenters to eat, having some overlap of the two time slots so the second session runs less late, grouping posters that are on related topics in the same area, and including an introductory session where poster presenters give a 1-minute pitch on their poster.

  • Workshops: One of the workshop chairs should be affiliated with the local

(hosting) institution, since there are a lot of local arrangements issues that arise with the workshops.

Other suggestions are included in the reports from other chairs. While there were several areas for improvement, overall, I consider the conference to be a success. There were roughly 700 participants, and the quality of the tutorials, presentations and workshops was high. The local arrangements were terrific. I am indebted to all the chairs involved in the organization and to the NAACL Board for their support. While there remains an imbalance between NLP, speech and IR, I am encouraged by the quality of the papers that were included. I strongly support continued efforts to include these different areas of HLT and make it possible for researchers to benefit from the insights of these related field.

Mari Ostendorf, University of Washington General Chair


NAACL 2009 Program Chairs Report

In 2009 the NAACL HLT program continued to include high-quality work in the areas of computational linguistics, information retrieval, and speech technology. The program included full papers, short papers, demonstrations, a student research workshop, pre-conference tutorials, and post-conference workshops. The call for papers included solicitation of papers for 2 special sessions, “Large-Scale Language Processing” and “Speech Indexing and Retrieval”. This year, 260 full papers were submitted, of which 75 papers were accepted (giving a 29% acceptance rate); and 178 short papers were submitted, of which 71 were accepted (giving a 40% acceptance rate). All full papers were presented as talks; this contrasts with some previous years, e.g., ACL-08 HLT, where some full papers were presented as posters. Of the short papers, 35 were presented as talks, with the remainder being presented as posters. A full breakdown of the statistics by area is presented at the end of this report.

This year, short papers of five types were solicited: “a small, focused contribution”, “work in progress”, “a negative result”, “an opinion piece”, or “an interesting application note”; it was a reviewer task to determine which paper type a short paper best belonged to, alternatively, this could be a check-box at submission time. In practice, the largest agreement among reviewers was found in the “small, focused contribution” category, the traditional type of short paper submitted to NAACL HLT (119/178). A majority of reviewers thought that 38/178 papers were “work in progress”, and that 10/180 were “interesting application note”. There were only a handful of papers submitted that any of the reviewers considered to be a “negative result” or “opinion piece”. It will take more than one conference cycle to determine the field’s interest in writing, and then accepting, such paper types.

Reviewing was organised in a two-tier system, with eighteen senior program committee (SPC) members (“area chairs”), who in turn recruited 352 reviewers. The SPC members managed the review process for both the full and short paper submissions: each full paper received at least three reviews, and each short paper received at least two reviews. Similar to recent years, we did not have a face-to-face meeting of the area chairs, instead we held a series of tele-conferences between individual area chairs and the PC chairs. The START conference management system was used to manage paper submissions and the review process—Rich Gerber and the START team gave invaluable help with the system.

Two best paper awards were given at the conference. The senior program committee members for the conference nominated an initial set of papers that were candidates for the awards; the final decisions were then made by a committee chaired by Candace Sidner, and with Hal Daume III, Roland Kuhn, Ryan McDonald, and Mark Steedman as its other members.

Michael Collins, Massachusetts Institute of Technology Shri Narayanan, University of Southern California Douglas W. Oard, University of Maryland Lucy Vanderwende, Microsoft Research

Table 1: Statistics for full paper submissions.


Area Submissions Acceptances (Talk)

Sentiment/Information Extraction 34 8 (24%) Discourse 10 4 (40%) Generation/Summarization 22 2 (9%) Machine learning 28 8 (29%) Phonology/Morphology/Language acquisition 14 5 (36%) Semantics 32 7 (22%) Syntax 33 11 (33%) Machine translation 37 13 (35%) Dialog 9 3 (33%) IR/Question answering 16 4 (25%) Large-scale language processing 11 3 (27%) Speech indexing and retrieval 1 0 (0%) Speech/Spoken Language Processing Algorithms 9 4 (44%) Speech/Spoken Language Processing Applications 8 3 (38%)


Area Submissions Acceptances Acceptances (talk) (poster) Sentiment/Information Extraction 22 3 (14%) 4 (18%) Discourse 7 - 2 (29%) Generation/Summarization 10 2 (20%) 3 (30%) Machine Learning 14 2 (14%) 2 (14%) Phonology/Morphology/Language Aquisition 4 - 1 (25%) Semantics 17 4 (24%) - Syntax 16 3 (19%) 4 (25%) Machine Translation 30 8 (27%) 5 (17%) Dialog 11 2 (18%) 3 (27%) IR/Question answering 16 3 (19%) 3 (19%) Large Scale Processing 6 1 (17%) 2 (33%) Speech Indexing and Retrieval 7 5 (71%) - Speech/Spoken Language Algorithms 9 1 (11%) 5 (56%) Speech/Spoken Language Applications 9 1 (11%) 2 (22%) Table 2: Statistics for short paper submissions. 2 Report from Student Research Workshop/ Doctoral Consortium The Student Research Workshop provided a venue for student researchers investigating topics in the broad fields of Computational Linguistics and Language Technologies to present their work and receive feedback from the community. The workshop was composed of three parallel tracks in Natural Language Processing, Information Retrieval, and Speech. We received a total of 29 submissions (4 IR, 4 Speech, 21 NLP) from 11 countries. Submissions were up from last year, although that was not until after we extended the deadline twice, so we wonder if there may be too many similar venues competing for submissions, such as the ACL SRW and the EACL SRW. Considering the uneven distribution of submissions to the three tracks, the topical organization of the workshop into tracks should also be re-thought. Of the 29 submission, we accepted 9 as oral presentations (one of which withdrew from the workshop) and another 9 as poster presentations. Accepted oral presentations and posters came form 9 different countries. Both oral presentation and poster presentation sessions were scheduled during the main conference; each paper accepted for oral presentation was also given a slot in the poster session. We made a special effort to schedule the sessions at times when many senior people in the field would be able to attend and offer their valuable wisdom from years. A total of 86 students and senior researchers agreed to serve on the program committee, which allowed us to assign 4 to 6 reviewers per paper. During the workshop, each oral presentation was followed by a brief panel discussion by two panelists per paper. Despite the extra effort of having to recruit panelists, we believe that the panels added considerable extra value to the workshop. Not only did it ensure good feedback to the presenters, it also helped the audience put the papers into perspective within the respective research fields. Each of the three oral presentation sessions drew an audience of 30 to 50 people. All presenters received financial support from the U.S. National Science Foundation to assist them in their travel to Boulder for the conference. Altogether we received $21,000 from the National Science Foundation to fund the workshop, which included support for student participants, student co-chairs, and the cost of the student lunch. Oral presenters were offered $400 to defray the cost of registration and hotel as well as $500 to cover travel from within North America, or $1000 if they were traveling internationally. Poster presenters were offered $300.00 total for reimbursement. We also budgeted a small amount of money for materials, such as poster boards for the poster presentations. At the student lunch on the day of the SRW, we had a group discussion to get feedback from the student community on how the SRW could be improved, and in general what could be done to offer mentoring to the students in our community. One issue that was raised is that it is not very clear what sets the SRW apart from the main conference or exactly what types of submissions are desired. Students felt that some of the feedback they received from reviewers wasn’t consistent with what the call for participation described as the target submissions. In response to this, we may need a more structured review form. One participant in the discussion pointed out that the form used for ACL short papers this year was a particularly good example of how to keep reviewers thinking along the right lines for review. Another issue that was raised is that students are not getting encouragement from their advisors to submit to the SRW. So we may need to go back to the faculty segment of the ACL community to find out why and what we can do about it. Students expressed a desire for more networking opportunities at ACL conferences, especially to help the shyer students come out of their shells. One idea was to organize topic specific round tables where selected faculty would attend, but which would be mainly students interested in similar topics. Other ideas included websites to help students find roommates for conferences and distributing contact information for all people who are registered by a particular date, along with their affiliations and research interests, to help students plan for who they want to try to set up meetings with, etc. during the conference. Student Research Workshop Faculty Chairs and Student Co-Chairs Anoop Sarkar (Faculty Chair, Simon Fraser University) Carolyn Rose (Faculty Chair, CMU) Svetlana Stenchikova (Student Co-Chair, Stony Brook University) - Speech Ulrich Germann (Student Co-Chair, University of Toronto) - NLP Chirag Shah (Student Co-Chair, University of North Carolina) - Information Retrieval Brief Reports from Other NAACL HLT Chairs Publicitiy Chairs  Matthew Stone (Rutgers)  Gokhan Tur (SRI)  Diana Inkpen (U Ottawa) The Publicity Chairs were chosen from each of the three HLT areas -- IR, speech and NLP – in order to ensure good connections to the communities. The forwarded all announcements to mailing lists in their respective fields, including the main conference Call for Papers and Call for Short Papers, the Calls for Workshop and Tutorial Proposals, the Call for Demos, and the Doctoral Consortium Call for Papers. The lists and websites used included: corpora, elsenet, acl, asis-l, linguist, webir, ISCA ISCAPad, IEEE eNewsletter, AI Magazine, and the cognitive science society website. Future organizers should bear in mind that many organizations produce bimonthly or quarterly newsletters for conference announcements (e.g., AAAI's AI magazine, IEEE signal processing speech & language technical committee), which require CFPs to be distributed at least 2-3 months in advance of submission deadlines. Publications Chairs  Christy Doran (MITRE)  Eric Ringger (BYU) At the recommendation of the ACL-HLT 2008 chair, we continued the tradition from ACL 2008 of having two Publication Chairs, which seemed to work well. The Chairs followed the recipe written for publications chairs by Joakim Nivre and Noah Smith, located here: http://stp.lingfil.uu.se/~nivre/how-to-pub.html This includes the updated recipe for using ACLPUB to assemble the actual proceedings: http://faculty.cs.byu.edu/~ringger/naacl09/howto.html Several improvements are in the queue for both documents as well as the ACLPUB tools. Notes for improvement and discussion: o Better publicizing and enforcement of publications-related deadlines. o Mailing lists for relevant subsets of organizing committee, including: PC cochairs, local organizers, workshop chairs and sponsorship chairs (There are no sponsor logos in the proceedings, since the sponsorship chairs did not know about the deadline.) o Improved documentation of pre-requisites for hand-off to OmniPress (e.g. file formats), especially regarding book covers. o Per Jan Hajic, there is an opportunity to integrate some of ACLPUB into START o Numbers of needed printed volumes continue to drop. It may be a good time to consider going digital only. o Recommendations for shared documentation on aclweb.org? Tutorials Chairs  Ciprian Chelba (Google)  Paul Kantor (Rutgers)  Brian Roark (OHSU) The tutorial chairs actively recruited submissions, received 12, and accepted 8. They erred on the side of accepting rather than rejecting because of “ties” in the reviews, and felt that it worked out quite well. Even though 8 tutorials is more than in most years (typically 6), we ended up with sufficient enrollment in all 8 accepted proposals. The complete list of tutorials is given below. 1. Data-Intensive Text Processing with MapReduce -- Jimmy Lin and Chris Dyer 45 participants 2. Distributed Language Models -- Thorsten Brants and Peng Xu 32 participants 3. Search Algorithms in NLP: Theory and Practice with Dynamic Programming -- Liang Huang 53 participants 4. Extracting world/linguistic knowledge from Wikipedia -- Simone Paolo Ponzetto and Michael Strube 34 participants 5. OpenFst: An Open-Source, Weighted FST Library -- Martin Jansche/Cyril Allauzen/Michael Riley 24 participants 6. OntoNotes: The 90% Solution -- Sameer Pradhan and Nianwen Xue 12 participants 7. VerbNet overview, extensions, mappings and apps -- Karin Kipper Schuler, Anna Korhonen, Susan W. Brown 24 participants 8. Writing Systems, Transliteration and Decipherment -- Richard Sproat and Kevin Knight 20 participants Workshops Chairs  Mark Hasegawa-Johnson (UIUC),  Nizar Habash (Columbia) There were 41 workshop submissions jointly to ACL, EACL, and NAACL. ACL accepted 12, EACL accepted 10, and we accepted 11. Eleven plus the student workshop and CoNLL gave a total of 13 workshops, listed below. Number of participants listed is the final estimate from Priscilla Rasmussen as of May 26. For more information, see http://isle.uiuc.edu/hltnaacl2009/. 1. Semantic Evaluations: Recent Achievements and Future Directions Organizers: Eneko Agirre, LluÌs Marquez, Richard Wicentowski 42 participants 2. BioNLP 2009 Organizers: Sophia Ananiadou, K. Bretonnel Cohen, Dina Demner-Fushman, John Pestian, Jun'ichi Tsujii, Bonnie Webber 74 participants 3. Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies Organizers: Sivaji Bandyopadhyay, Pushpak Bhattacharya, Vasudeva Varma, Sudeshna Sarkar, A Kumaran 14 participants 4. Workshop on Integer Linear Programming for Natural Language Processing Organizers: James Clarke, Sebastian Riedel 25 participants 5. Software engineering, testing, and quality assurance for natural language processing Organizers: Kevin Bretonnel Cohen, Marc Light 34 participants 6. Computational Approaches to Linguistic Creativity Organizers: Birte Loenneker-Rodman, Anna Feldman 34 participants 7. Unsupervised and minimally supervised learning of lexical semantics Organizers: Suresh Manandhar, Ioannis Klapaftis 34 participants 8. Semi-supervised Learning for NLP Organizers: Qin Wang, Kevin Duh, Dekang Lin 75 participants 9. Active Learning for NLP Organizers: Eric Ringger, Robbie Haertel, Katrin Tomanek 35 participants 10. Innovative Use of NLP for Building Educational Applications Organizers: Joel Tetreault, Jill Burstein, Claudia Leacock 34 participants 11. Third Workshop on Syntax and Structure in Statistical Translation Organizers: Dekai Wu, David Chiang 48 participants 12. Thirteenth Conference on Computational Natural Language Learning (CoNLL) Organizers: Suzanne Stevenson and Xavier Carreras 85 participants The multi-conference proposal system seemed to work very well for all concerned: three workshops that would not have been offered (because they were rejected by their firstchoice conference) were, instead, offered at NAACL. All of the workshops had full schedules, as indicated in the online schedule. Demo Chairs  Fred Popowich (Simon Fraser University)  Michael Johnston (AT&T) Six demos were submitted, and five were accepted. The demo chairs did not actively recruit demos from specific research groups, and relied on the general publicity efforts. Future organizers might consider opening the demos up to allow people who have accepted papers to give demos in the demo session. From attending the demo session, the presenters seemed to be happy with how it went (and they were glad that they were in the high traffic area).