SIGDAT - 2007 Summer Report SIGDAT is ACL's special interest group for linguistic data and corpus-based approaches to NLP. In 2007, SIGDAT is organizing a 3-day Joint Conference on Empirical Methods in Natural Language Processing and Conference on Computational Natural Language Learning (EMNLP-CoNLL 2007) with our sister sig SIGNLL. The meeting is scheduled immediately after ACL-07 in Prague on June 28-30. Jason Eisner is program chair, Jan Hajic is local arrangements chair, and Eric Ringger is publications chair. The conference appears to be highly successful: Over 400 submissions were received, and 66 papers were accepted for oral presentation and 44 were accepted as posters, yielding an total acceptance rate of approximately 25% (and 15% acceptance for full oral presentation). The proceedings exceeds 800 pages, and essentially the entire conference will be held in parallel sessions. In terms of scale on several dimensions, EMNLP is now regularly at a similar size to recent NAACL/HLT and EACL meetings. In 2006, SIGDAT organized a 2-day Conference on Empirical Methods in Natural Language Processing (EMNLP-2006), held immediately after ACL-06 in Sydney on July 22-23. Dan Jurafsky and Eric Gaussier were program chairs. As with this year, over 400 submissions were received, and 43 full papers accepted and 30 posters accepted, yielding an acceptance rate under 20%. The proceedings exceeded 600 pages. As one of ACL's first SIGS, SIGDAT was formed prior to the requirement that SIGs have a constitution. SIGDAT is taking steps to create a constitution and further normalize our structure before the end of the year, consistent with ACL policy. We will also actively pursue the question of the role of EMNLP, in particular in respect to its scheduling in conjunction with other ACL events, as it continues to grow. - David Yarowsky Secretary-Treasurer