Moses-support Digest, Vol 113, Issue 47

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. New parallel corpus release: OpenSubtitles2016 (Jorg Tiedemann)
2. Final CfP EAMT 2016 (deadline March 25th) (Antonio Toral)
3. failed updating 1 target... (Zhanwang Chen)


----------------------------------------------------------------------

Message: 1
Date: Tue, 15 Mar 2016 20:47:40 +0200
From: Jorg Tiedemann <tiedeman@gmail.com>
Subject: [Moses-support] New parallel corpus release:
OpenSubtitles2016
To: moses-support <moses-support@mit.edu>
Message-ID: <4E7B74D9-D02D-4C62-9B70-6C0294527EFD@gmail.com>
Content-Type: text/plain; charset="utf-8"

OpenSubtitles2016

We just released a major update of the parallel subtitle corpus in OPUS:
http://opus.lingfil.uu.se/OpenSubtitles2016.php <http://opus.lingfil.uu.se/OpenSubtitles2016.php>

2.8 million subtitle files in 60 languages with a total of over 17 billion tokens in 2.6 billion sentences and sentence fragments.
As usual in OPUS all languages are sentence-aligned creating a total of 1,689 bitexts.
The data sets are provided in standalone XML format with standoff sentence alignment, TMX and aligned plain text format (often used in training SMT models).

More information is available in:
Pierre Lison and J?rg Tiedemann, 2012, OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
http://stp.lingfil.uu.se/~joerg/paper/opensubs2016.pdf <http://stp.lingfil.uu.se/~joerg/paper/opensubs2016.pdf>


In addition, we also provide intra-lingual alignments between alternative subtitles in the same language:
http://opus.lingfil.uu.se/OpenSubtitles2016alt.php <http://opus.lingfil.uu.se/OpenSubtitles2016alt.php>

More information about those alignments and how they are sorted into various categories can be found in:
J?rg Tiedemann, 2012, Finding Alternative Translations in a Large Corpus of Movie Subtitles.
In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
http://stp.lingfil.uu.se/~joerg/paper/lrec2016.pdf <http://stp.lingfil.uu.se/~joerg/paper/lrec2016.pdf>

Note, that all data sets are automatically created using various pre-processing and alignment tools.
There will be problems at various levels. Feedback is very welcome!


Other new data sets in OPUS:

News Commentary version 11 (originally provided by CASMACAT):
http://opus.lingfil.uu.se/News-Commentary11.php <http://opus.lingfil.uu.se/News-Commentary11.php>
Different to the original source, this release is truly multilingual with alignments across all languages.

Global Voices (also provided by CASMACAT):
http://opus.lingfil.uu.se/GlobalVoices.php <http://opus.lingfil.uu.se/GlobalVoices.php>
Again, this version is multilingual.

Wikipedia:
A corpus of parallel sentences extracted from Wikipedia by Krzysztof Wo?k and Krzysztof Marasek. More information: Krzysztof Wo?k and Krzysztof Marasek: Building Subject-aligned Comparable Corpora and Mining it for Truly Parallel Sentence Pairs., Procedia Technology, 18, Elsevier, p.126-132, 2014
http://www.sciencedirect.com/science/article/pii/S2212017314005453 <http://www.sciencedirect.com/science/article/pii/S2212017314005453>


For more information on OPUS:
http://opus.lingfil.uu.se/index.php <http://opus.lingfil.uu.se/index.php>
Select the language pair you are interested in to see all resources that are available for that particular language pair.
Data formats are explained here: http://opus.lingfil.uu.se/trac <http://opus.lingfil.uu.se/trac>

Enjoy!


**********************************************************************************
J?rg Tiedemann
Department of Modern Languages http://www.helsinki.fi/~tiedeman/ <http://www.helsinki.fi/~tiedeman/>
University of Helsinki

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160315/729cf33d/attachment-0001.html

------------------------------

Message: 2
Date: Wed, 16 Mar 2016 11:12:32 +0100
From: Antonio Toral <atoral@computing.dcu.ie>
Subject: [Moses-support] Final CfP EAMT 2016 (deadline March 25th)
To: apertium-stuff@lists.sourceforge.net, moses-support@mit.edu
Message-ID: <56E93190.3000107@riseup.net>
Content-Type: text/plain; charset=utf-8; format=flowed

**************************************************************************
Final Call for Papers
EAMT 2016, Riga, Latvia - http://eamt2016.tilde.com/
19th
Annual Conference of the European Association for Machine Translation
**************************************************************************


*Deadline* March 25th, 2016
*Keynote speaker* Alexander Fraser (CIS, LMU Munich), Translation to
Morphologically Rich Languages


The European Association for Machine Translation (EAMT,
http://www.eamt.org) invites everyone interested in machine translation,
translation-related tools and resources to participate in this
conference ? developers, researchers, users, professional translators
and translation/localisation managers: anyone who has a stake in the
vision of an information world in which language barriers and issues
become less visible to the information consumer. We especially invite
researchers to describe the state of the art and demonstrate their
cutting-edge results, and professional MT users to share their experiences.

EAMT 2016, the 19th Annual Conference of the European Association for
Machine Translation, will be held in Riga, Latvia, from May 30th to June
1st, 2016.

We expect to receive manuscripts in these three categories:


-------------------
(R) Research papers
-------------------

Long-paper submissions (12 pages) are invited for reports of significant
research results in any aspect of machine translation and related areas.
Such reports should include a substantial evaluation component, or have
a strong theoretical and/or methodological contribution where results
and in-depth evaluations may not be appropriate. Papers are welcome on
all topics in the areas of machine translation and translation-related
technologies, including:

* Advances in various MT paradigms: data-driven, rule-based, and hybrid
approaches
* Technologies for MT deployment: quality estimation, domain adaptation,
etc.
* MT in special settings: low resources, massive resources, high volume,
low computing resources
* MT applications: translation/localisation aids, speech-to-speech,
speech-to-text, OCR, MT for user generated content (blogs, social
networks), etc.
* Linguistic resources for MT: dictionaries, terminology, corpora, etc.
* MT evaluation techniques and evaluation results
* Human factors in MT and user interfaces
* Related multilingual technologies: natural language generation,
information retrieval, text categorisation, text summarisation,
information extraction, etc.

Papers should describe original work. They should emphasise completed
work rather than intended work, and should indicate clearly the state of
completion of the reported results. Where appropriate, concrete
evaluation results should be included.


----------------
(U) User studies
----------------

Short-paper submissions (3-6 pages) are invited for reports on users'
experiences with MT, be it in small or medium size business (SMB),
enterprise, government, or NGOs. Contributions are welcome on:

* Integrating MT and computer-assisted translation into a translation
production workflow (e.g. transforming terminology glossaries into MT
resources, optimizing TM/MT thresholds, mixing online and offline tools,
using interactive MT, dealing with MT confidence scores);
* Use of MT to improve translation or localisation workflows (e.g.
reducing turnaround times, improving translation consistency, increasing
the scope of globalisation projects);
* Managing change when implementing and using MT (e.g. switching between
multiple MT systems, limiting degradations when updating or upgrading an
MT system);
* Implementing open-source MT in the SMB or enterprise (e.g. strategies
to get support, reports on taking pilot results into full deployment,
examples of advanced customisation sought and obtained thanks to the
open-source paradigm, collaboration within open-source MT projects);
* Evaluation of MT in a real-world setting (e.g. error detection
strategies employed, metrics used, productivity or translation quality
gains achieved);
* Post-editing strategies and tools (e.g. limitations of traditional
translation quality assurance tools, challenges associated with
post-editing guidelines);
* Legal issues associated with MT, especially MT in the cloud (e.g.
copyright, privacy);
* Use of MT in social networking or real-time communication (e.g.
enterprise support chat, multilingual content for social media);
* Use of MT to process multilingual content for assimilation purposes
(e.g. cross-lingual information retrieval, MT for e-discovery or spam
detection, MT for highly dynamic content);
* Use of standards for MT.

Papers should highlight problems and solutions and not merely describe
MT integration process or project settings. Where solutions do not seem
to exist, suggestions for MT researchers and developers should be
clearly emphasised. For user papers produced by academics, we require
co-authorship with the actual users.


-------------------------------
(P) Project/Product description
-------------------------------

Abstract submissions (1 page) are invited to report new, interesting:

* Tools for machine translation, computer aided translation, and the
like (including commercial products and open-source software). The
authors should be ready to present the tools in the form of demos or
posters during the conference.
* Research projects related to machine translation. The authors should
be ready to present the projects in the form of posters during the
conference. This follows on from the successful ?project villages? held
at the last EAMT conferences.


---------
Programme
---------

The programme will include oral presentations and poster sessions.
Accepted papers may be assigned to an oral or poster session, but no
differentiation will be made in the conference proceedings.


---------------
Important Dates
---------------

* Paper submission: March 25th, 2016
* Notification to authors: April 22nd, 2016
* Camera-ready deadline: May 2nd, 2016
* Conference: May 30th-June 1st, 2016


------------
Publications
------------

The conference proceedings will be published as a special issue of the
Baltic Journal of Modern Computing (BJMC, http://www.bjmc.lu.lv/), a
scholarly open access electronic quarterly journal, which is indexed by
Thomson Reuters Web of Science Core Collection (Emerging Sources
Citation Index), EBSCO, ProQuest, Directory of Open Access Journals
(DOAJ), Google Scholar, VINITI, Directory of Research Journal Indexing
(DRJI) and Open J-Gate, and has applied to be indexed in Scopus.

In addition, the best accepted papers will be selected to be published,
in an extended version, and with a lighter reviewing process, as regular
papers in the Springer Machine Translation journal
(http://link.springer.com/journal/10590).


-----------
Submissions
-----------

Submissions will be judged on correctness, originality, technical
strength, significance and relevance to the conference, and potential
interest to all attendees. They should mostly contain new material that
has not been presented at any other meeting with publicly available
proceedings.

EAMT 2016 will use electronic submission through the EasyChair
conference tool. To submit a paper, go to the submission website at:
https://easychair.org/conferences/?conf=eamt2016 and follow the
instructions.

Papers that are being submitted in parallel to other conferences or
workshops and papers that contain significant overlap with previously
published work must indicate this on the title page and in the abstract
submitted through EasyChair (using capital letters). In case of
acceptance the paper will only be included in the proceedings if it is
not published in any other conference or workshop to which it was submitted.

Papers should be anonymised (no authors, affiliations or addresses, and
no explicit self-references), be no longer than 12 pages (A4 size) for
research papers, and no longer than 6 pages (A4 size) for user papers,
all in PDF format. Papers must conform to the format defined by the BJMC
template: http://www.bjmc.lu.lv/instructions-to-authors/.
Project/product descriptions do not need to be anonymised and should use
the 1-page template given in
http://eamt2016.tilde.com/sites/eamt2016.tilde.com/files/eamt-2016-product-template.doc.

For further information about this call for papers or if you encounter
any problem regarding submission please contact the track chairs at
eamt2016chairs@tilde.com and put in the subject "[user]" or "[research]"
depending on which track your question is related to. For questions
about the organisation (venue, registration, accommodation, visa,
payments, etc.) please contact the local organisers at eamt2016@tilde.com.


----------------------
EAMT Best Thesis Award
----------------------

The EAMT Best Thesis Award for PhD theses submitted during 2015 will be
awarded at the conference, together with a presentation of the winner?s
work. Information for candidates to the award is available at:
http://www.eamt.org/news/news_best_thesis2015.php. The deadline is the
same as for the paper submission.


------------------
Conference website
------------------

Please visit the conference web page (http://eamt2016.tilde.com/) for
the most up-to-date information about the calendar, the call for papers
and formatting requirements, the programme, invited speakers, related
conference activities, the venue, travel and registration.


---------------------
Conference organisers
---------------------

General Chair: Mikel Forcada (Universitat d?Alacant)

Track Chairs:
* Antonio Toral, Research programme chair (Dublin City University, Ireland)
* Tony O'Dowd, User programme co-chair (KantanMT, Ireland)
* Alexandru Ceausu, User programme co-chair (Amplexor, Luxembourg)

Local Organisation Chair: Andrejs Vasi?jevs (Tilde, Latvia)

Local host: Juris Borzovs (University of Latvia)



------------------------------

Message: 3
Date: Wed, 16 Mar 2016 11:42:54 +0100
From: Zhanwang Chen <zhanwang.c@gmail.com>
Subject: [Moses-support] failed updating 1 target...
To: moses-support@mit.edu
Message-ID:
<CAGdkKyRxum-5Bq-qpUMAP5Zop_JVOQX7SV-YT4qa=Q6ka4bSCQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Dear all,

I am trying to install Moses according to this page:
http://www.statmt.org/moses/?n=Development.GetStarted

installing Boost Manually,
when I reached this step, got error.
./bjam --with-boost=~/workspace/temp/boost_1_55_0 -j4

...skipped
<pmoses/LM/bin/BackwardTest.test/gcc-4.4.7/release/link-static/threading-multi>BackwardTest
for lack of
<putil/bin/gcc-4.4.7/release/link-static/threading-multi>mmap.o...
...skipped
<pmoses/LM/bin/BackwardTest.test/gcc-4.4.7/release/link-static/threading-multi>BackwardTest.run
for lack of
<pmoses/LM/bin/BackwardTest.test/gcc-4.4.7/release/link-static/threading-multi>BackwardTest...
...failed updating 1 target...
...skipped 205 targets...
...updated 614 targets...


I am using centos 6.5
[root@localhost mosesdecoder]# gcc -v
Target: x86_64-redhat-linux
gcc version 4.4.7 20120313 (Red Hat 4.4.7-16) (GCC)

please help.
Thanks a lot.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160316/b626f304/attachment.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: build.log.gz
Type: application/x-gzip
Size: 232 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20160316/b626f304/attachment.gz

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 113, Issue 47
**********************************************

0 Response to "Moses-support Digest, Vol 113, Issue 47"

Post a Comment