
My little Chelsea supporters, and
.......................... the view from my office window, with
thanks!
I am an Associate Professor in the School of Computing at Dublin City University, and have been here since 1991. I am also the School Research Convenor.
I am the Editor for the journal Machine Translation. Please contact me if you would like to submit to the journal.
I am also on the EAMT Committee, from the EAMT-09 conference as the new President of EAMT. DCU also act as Webmaster for EAMT.
Following the MT Summit 2009, I am now the Vice-President for the International Association for Machine Translation, from 2009--2011.
I am also a member of the National Centre for Language Technology (NCLT), and am a member of the Language and Intelligence Research group in our School.
I am also the track leader for Integrated Language Technologies in the Centre for Next Generation Localisation, aiming at facilitating optimal multilingual applications for deployment in the localisation industry.
Recent News
Two new journal
publications:
- Metric and Reference Factors in Minimum Error
Rate Training: Yifan He and Andy Way, Machine Translation
- An Incremental Three-pass System Combination
Framework by Combining Multiple Hypothesis Alignment Methods: Jinhua Du and
Andy Way, International
Journal of Asian Language Processing
We had a paper in the
recent EBMT-3
workshop, entitled Hybrid Rule-Based -- Example-Based MT: Feeding Apertium with Sub-sentential Translation Units, representing joint work with Felipe Sánchez-Martínez and Mikel Forcada.

I'm on the programme
committee for ACL 2010, which will
be held in Uppsala, Sweden from 11-16 July, 2010.

I'm on the programme
committee for COLING 2010, which will
be held in Beijing from Aug 23-27, 2010.

I've just been invited to the third Google Faculty Summit, to take place in Zurich 8-10 February 2010. The Natural Language Technologies stream of the 2010 Europe, Middle East and Africa (EMEA) Faculty Summit
will address topics at the intersection of research in Natural Language
Understanding and applied techniques for scalable Natural Language
Processing.

I'm on the programme
committee for NAACL-HLT 2010, which will be held in Los
Angeles from June 1-June 6th, 2010
We've just
had three papers accepted for PACLIC 23, the 23rd Pacific Asia Conference on Language, Information and Computation, to be held 3--5 Dec at the City University of Hong Kong. The papers are as follows:
- Experiments on Domain Adaptation for English--Hindi
SMT: Rejwanul Haque, Sudip Kumar Naskar, Josef van Genabith and Andy Way
- Capturing lexical variation in MT evaluation using automatically built
sense-cluster inventories: Marianna Apidianaki, Yifan He and Andy Way
- Dependency Relations as Source Context in Phrase-Based
SMT: Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch and Andy Way
We've just
had a paper accepted for presentation at the project exhibition at the
European
Conference on Machine Learning and Principles and Practice of
Knowledge Discovery in Databases (ECML PKDD), which will take
place in Bled, Slovenia, from September 7th to 11th, 2009. The paper
is entitled Noise Reduction Experiments in Machine Translation,
and is joint work with Tsuyoshi Okita and Sudip Naskar.

We've
just had a paper accepted for presentation at IALP 2009, the
latest in the series of International Conferences on Asian Language
Processing. This year's conference will take place on 7-9 December in Singapore. The paper is
entitled A Three-pass System Combination Framework by Combining Multiple
Hypothesis Alignment Methods, and is joint work with Jinhua Du.

Mikel
Forcada, in residence at the CNGL for the year, and I are co-organising the 3rd Workshop on Example-Based
Machine Translation with the theme Going open-source to revive EBMT at DCU November 12-13, 2009.
We've just heard that two of our papers have been accepted for publication in Machine Translation. The first paper is entitled Bidirectional Data-Driven Machine Translation for Irish and German Sign Languages, and represents joint work with Sara
Morrissey. The second paper is entitled Parallel
Treebanks and their Exploitability in Machine Translation, and represents joint work with John Tinsley.
We've just had a paper
accepted for presentation at Interspeech 2009, the 10th Annual Conference
of the International
Speech Communication
Association, which will take place on
6-10 September, 2009 in Brighton, UK. The paper is
entitled Using Same-Language Machine Translation to Create Alternative
Target Sequences for Text-To-Speech Synthesis, and is joint work with
Peter Cahill and Julie Berndsen of UCD, and Jinhua Du of DCU.
We've just had four papers
accepted for presentation at MT Summit 2009, the Twelfth Machine
Translation Summit, organized by the International Association for Machine
Translation and the Association for Machine Translation in the Americas, which will
be held at the Château Laurier, Ottawa, Canada, 26-30 August 2009. The papers are as follows:
- Source-Side Context-Informed Hypothesis Alignment for
Combining Outputs from Machine Translation Systems: Jinhua Du and Andy Way
- Tracking Relevant Alignment Characteristics for
Machine Translation: Patrik Lambert, Yanjun Ma, Sylwia Ozdowska
and Andy Way
- Improving the Objective Function in Minimum Error Rate
Training: Yifan He and Andy Way
- Using Percolated Dependencies for Phrase Extraction in
SMT: Ankit Srivastava and Andy Way
We've just had a paper accepted for presentation at RANLP 2009, the International Conference on Recent Advances in Natural Language Processing, to take place September 14-16, 2009, in Borovets, Bulgaria. The paper is entitled Lexicalized Semi-Incremental Dependency Parsing and is joint work with Hany Hassan and Khalil Sima'an.
We've just had two papers accepted for presentation at EMNLP,
which will be held on August 6--7 at the Suntec Singapore International Convention & Exhibition Centre, immediately following ACL/IJCNLP 2009. The first paper is entitled A Syntactified Direct Translation Model with Linear-time Decoding and is joint work with Hany Hassan and Khalil Sima'an. The second paper is entitled Accuracy-Based Scoring for DOT: A Step Towards Evaluation Measure Based MT Training and is joint work with Sergio Penkale and Daniel Galron, a student of Dan Melamed's who recently completed a placement in DCU funded by CNGL.
We've just had a paper accepted for presentation at the Named Entities Workshop (NEWS) 2009, to take place on 7th August, 2009 at ACL/IJCNLP 2009. The paper is entitled English--Hindi Transliteration Using Context-Informed PB-SMT: the DCU System, and is joint work with Rejwanul Haque, Ankit Srivastava and Sudip Naskar.
I'm on the programme
committee for IWSLT 2009, which will be held in Tokyo, Japan, on December 1-2, 2009. This year IWSLT-09 will be co-located with IUCS 2009.
x
At the EAMT-09 conference in Barcelona, I was elected as the new President of EAMT, following the sterling service provided to our community in this regard by Bente Maegaard.
We had a paper accepted for presentation at SETQA-NLP 2009, a workshop at HLT-NAACL 2009, on "Software engineering, testing, and quality assurance for natural language processing". This will take place in Boulder, Colorado, on Friday, June 5th. The paper is entitled Web service integration for next generation localisation, and represents joint work with David Lewis, Stephen Curran, Kevin Feeney, Zohar Etzioni, John Keeney (all TCD) and Reinhard Schäler (University of Limerick).
I was on the programme
committee for EMNLP,
which will be held on August 6--7 at the Suntec Singapore International Convention & Exhibition Centre, immediately following ACL/IJCNLP 2009.
I was on the programme
committee for MT Summit XII, which will take place from August 25--29 2009 at the Chateau Laurier in Ottawa, Ontario, Canada.
We had five papers accepted for presentation at EAMT-09, the 13th Annual Meeting of the European Association for Machine Translation, which will take place May 14-15 at UPC in Barcelona, Spain. The five papers are as follows:
- Using Supertags as Source Language Context in SMT: Rejwanul Haque, Yanjun Ma, Sudip Naskar, Andy Way
- Tuning Syntactically Enhanced Word Alignment for Statistical Machine Translation: Yanjun Ma, Patrik Lambert, Andy Way
- Learning Labelled Dependencies in Machine Translation Evaluation: Yifan He and Andy Way
- Marker-based Filtering of Bilingual Phrase Pairs for SMT: Felipe Sánchez-Martinez and Andy Way
- Optimal Bilingual Data for French-English PB-SMT: Sylwia Ozdowska and Andy Way
I gave a keynote at the 2nd International Conference on
Arabic Language Resources and Tools run by the MEDAR project, to take place on 22-23 April 2009 at the Hotel Grand Hyatt Cairo, Egypt.
We had a paper accepted for presentation in the Special Issue on Machine Translation of Asian
Languages of ACM TALIP. The paper is entitled Bilingually Motivated Word Segmentation for
Statistical Machine Translation, and is joint work with Yanjun Ma.
We had a paper
accepted for presentation at the Fourth Workshop on Statistical
Machine Translation, EACL 2009, to take place in Athens, Greece, on March 30 and 31, 2009. The paper is entitled MaTrEx: the DCU MT System for WMT
2009, and is joint work with Jinhua Du, Yifan He and Sergio Penkale.
We
had a paper accepted for presentation at CICLING 2009, to be held in Mexico
City from March 1-7, 2009. The paper is entitled Parallel Treebanks in Phrase-Based
Statistical Machine Translation and is joint work with John Tinsley and
Mary Hearne.
We
had a paper accepted for presentation at EACL-09 to be held in Athens, Greece, from March 30-April 3, 2009. The paper is entitled Bilingually Motivated Domain-Adapted Word Segmentation
for Statistical Machine Translation and is joint work with Yanjun Ma.
I was on the
Programme Committee for the Joint
ACL-IJCNLP 2009 conference to be held in Singapore on August 2-7, 2009.
We just submitted our system description paper for the NLP Tools Contest: Statistical Machine Translation (English to Hindi), part of the 6th International Conference on Natural Language Processing in Pune, India, to be held on 20-22 December 2008.
I attended
the JHU 2009
Summer Workshop Planning Meeting this weekend Nov 7--9.
Khalil Sima'an and I put together an proposal to incorporate
syntax into today's PB-SMT systems. We came fifth ...
I was on the
Programme Committee for the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL-09) to be held in Athens, Greece, from March 30-April 3, 2009.
New recruits
to our team include Marianna Apidianiki (Jussieu, Paris: postdoc),
Anton Bryl (Trento: postdoc),
Yifan He (Tsinghua Univ., Beijing: PhD), Rejwanul Haque (Jadavpur
Univ., Kolkata: PhD), and Sandipan Dandapat
(IIT-Hyderabad: PhD). Welcome all!
We
had a paper accepted for presentation at the Second IEEE Spoken Language Technology (SLT) workshop to be held in Goa, India in December 2008. The paper is entitled A Syntactic Language Model based on Incremental CCG Parsing, and is joint work with Hany Hassan and Khalil Sima'an.
I was on the organizing committee (as one of the publicity chairs) for ACL-IJCNLP to take place in Singapore 2-7 August 2009.
Older Items
News from 2008, Old News from 2007, Older News from 2006,
Even Older News from 2004 & 2005, and Ancient News from 2003 (hardly 'news' at all now!)!