Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Data Selection Tool (Amir Kamran)
----------------------------------------------------------------------
Message: 1
Date: Thu, 17 Sep 2015 09:32:15 +0000
From: Amir Kamran <amirkamran@gmail.com>
Subject: [Moses-support] Data Selection Tool
To: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<CAL0aJKiuZm3NbA3mSuKosgaRJOZZ+TendhwF2NT8ke6gnWmx1Q@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
A tool for domain adaptation and data selection for SMT was implemented by
Amir Kamran (SLPL Lab.
<https://staff.fnwi.uva.nl/k.simaan/research_all.html>, ILLC
<https://www.illc.uva.nl/>, University of Amsterdam <http://www.uva.nl/en>)
based on
[Hoang Cuong and Khalil Sima?an.
<https://www.illc.uva.nl/People/show_person.php?Person_id=Hoang+C.> Latent
Domain Translation Models in Mix-of-Domains
<http://www.aclweb.org/anthology/C14-1182> on Computational Linguistics].
The developed tool is available at the following Github repository:
<https://github.com/amirkamran/InvitationModel>
https://github.com/amirkamran/InvitationModel
Description
The Invitation based data selection approach exploits in-domain
data (both monolingual and bilingual) as prior to guide word alignment and
phrase pair estimates in a large mix-domain corpus. Accurate estimates are
obtained for the probability P(D|e,f) of every mixed-domain sentence pair
(e,f) being in-domain (D=1) or out-of-domain (D=0), be used to rank the
sentences in mix-domain according to their relevance to in-domain corpus or
used directly as model features when extracting phrase pairs.
The re-implemenation was conducted at ILLC (Institute for Logic, Language
and Computation, University of Amsterdam) https://www.illc.uva.nl in part
within the project "Data-Powered Domain-Specific Translation Services On
Demand"
<http://www.stw.nl/nl/content/data-powered-domain-specific-translation-services-demand-dataptor>,
supported by the grant "STW Open Technologieprogramma
<http://www.stw.nl/nl/content/open-technologieprogramma>".
-
Amir Kamran is supported by STW project DatAptor
<http://www.stw.nl/nl/content/data-powered-domain-specific-translation-services-demand-dataptor>
grant nr. 12271
-
Hoang, Cuong is supported by EXPERT ITN project <http://expert-itn.eu/>.
-
Sima'an, Khalil (2014) is partly supported by NWO Vici project
<https://www.illc.uva.nl/AbouttheILLC/Activities/grants.php> grant nr.
277-89-002.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150917/c089ad6b/attachment-0001.html
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 107, Issue 43
**********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 107, Issue 43"
Post a Comment