Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Call for Participation: Shared Task on Parallel Corpus
Filtering (WMT18) (Philipp Koehn)
----------------------------------------------------------------------
Message: 1
Date: Wed, 4 Apr 2018 18:05:42 -0400
From: Philipp Koehn <phi@jhu.edu>
Subject: [Moses-support] Call for Participation: Shared Task on
Parallel Corpus Filtering (WMT18)
To: wmt-tasks@googlegroups.com, "corpora@uib.no" <CORPORA@uib.no>,
Moses Support <moses-support@mit.edu>, Multiple recipients of list
<mt_list@nist.gov>
Message-ID:
<CAAFADDBvjQ1rPWju8EoAQqKRpGudsoi8o+EkqpRDsg1oE6RbwQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
CALL FOR PARTICIPATION
*Shared Task: Parallel Corpus Filtering*
at the Third Conference on Machine Translation (WMT18)
http://statmt.org/wmt18/parallel-corpus-filtering.html
This new shared task tackles the problem of cleaning noisy parallel
corpora. Given a noisy parallel corpus (crawled from the web), participants
develop methods to filter it to a smaller size of high quality sentence
pairs.
*DETAILS*
We provide a very noisy 1 billion word (English token count) German-English
corpus crawled from the web as part of the Paracrawlproject. We ask
participants to subselect sentence pairs that amount to (a) 100 million
words, and (b) 10 million words. The quality of the resulting subsets is
determined by the quality of a statstical machine translation (Moses,
phrase-based) and neural machine translation system (Marian) trained on
this data. The quality of the machine translation system is measured by
BLEU score on the (a) official WMT 2018 news translation test set and (b)
another undisclosed test set.
*IMPORTANT DATES*
Release of raw parallel data: April 1, 2018
Submission deadline for subsampled sets: June 22, 2018
System descriptions due: July 6, 2018
Announcement of results: July 9, 2018
Camera-ready for system descriptions: July 27, 2018
*ORGANIZERS*
Philipp Koehn (Johns Hopkins University / University of Edinburgh)
Huda Khayrallah (Johns Hopkins University)
Kenneth Heafield (University of Edinburgh)
Mikel Forcada (University of Alicante)
*ACKNOWLEDGEMENTS*
This shared task is partially supported by a Google Faculty Research Award
and the Connecting Europe Facility via the Paracrawl project.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20180404/ef345dae/attachment-0001.html
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 138, Issue 1
*********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 138, Issue 1"
Post a Comment