Moses-support Digest, Vol 115, Issue 28

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. error while pruning (Sanjanashree Palanivel)
2. Official release of the United Nations Parallel Corpus v1.0
(Marcin Junczys-Dowmunt)


----------------------------------------------------------------------

Message: 1
Date: Wed, 25 May 2016 14:12:19 +0530
From: Sanjanashree Palanivel <sanjanashree@gmail.com>
Subject: [Moses-support] error while pruning
To: moses-support@mit.edu
Message-ID:
<CAAc_kp6nBkDQj4Lf_H-FdYbLaXZpBa=R0fqFMFacWxb=vA_R1w@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi,

I tried pruning to two phrase-table files and I receive plenty of error
messages in one file:

No occurrences found!!

and the pruned table is just too small to be true.
Whereas the another phrase-table is getting pruned properly.
What might be the error?


--
Thanks and regards,

Sanjanasri J.P
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160525/2360419a/attachment-0001.html

------------------------------

Message: 2
Date: Wed, 25 May 2016 12:32:51 +0200
From: Marcin Junczys-Dowmunt <junczys@amu.edu.pl>
Subject: [Moses-support] Official release of the United Nations
Parallel Corpus v1.0
To: moses-support <moses-support@mit.edu>, mt-list@eamt.org,
"Pouliquen, Bruno" <bruno.pouliquen@wipo.int>, "Michal ZIEMSKI
(MZIEMSKI@unog.ch)" <MZIEMSKI@unog.ch>
Message-ID: <57457F53.9070600@amu.edu.pl>
Content-Type: text/plain; charset=utf-8; format=flowed

Dear all,
I would like to announce the official release of the United Nations
Parallel Corpus v1.0. The corpus was created as part of the United
Nations commitment to multilingualism and as a reaction to the growing
importance of statistical machine translation (SMT) within the
Department for General Assembly and Conference Management (DGACM)
translation services and the United Nations. It covers 25 years, from
1990 to 2014, and contains documents in the six official languages of
the United Nations: Arabic, Chinese, English, French, Russian, and Spanish.

The purpose of the corpus is to allow access to multilingual language
resources and facilitate research and progress in various natural
language processing tasks, including machine translation. For
convenience, the corpus is also available pre-packaged as bi-texts for
each language pair.

A subset of the corpus is available as a six-language fully-parallel
corpus, i.e. all sentences have equivalents in all six languages. Data
from 2015 has been used to created official development sets and test
sets, also fully aligned across the six official UN languages. The paper
reports SMT baselines for all languages pairs for this corpus.

The corpus is available at:

http://conferences.unite.un.org/UNCorpus

The corresponding publication is available at:

http://www.lrec-conf.org/proceedings/lrec2016/pdf/1195_Paper.pdf

While registering, please leave a short description of the work for
which you plan to use the corpus. In the near future we plan to set up a
section with references to papers that describe research done with UN
corpus. Feel free to share links and bibliography items with us (either
with me or any of the authors of the above paper).

Sorry for cross-posting,
Marcin Junczys-Dowmunt



------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 115, Issue 28
**********************************************

0 Response to "Moses-support Digest, Vol 115, Issue 28"

Post a Comment