Moses-support Digest, Vol 126, Issue 30

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: MERT: process n-best list before mert (Hieu Hoang)
2. Re: MERT: process n-best list before mert (Ergun Bicici)


----------------------------------------------------------------------

Message: 1
Date: Sat, 22 Apr 2017 18:41:09 +0100
From: Hieu Hoang <hieuhoang@gmail.com>
Subject: Re: [Moses-support] MERT: process n-best list before mert
To: Jorg Tiedemann <tiedeman@gmail.com>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CAEKMkbhL6M7YJw+cABy6O4mZ6J1Yf4vUKOiKZGRg7vMZaFdSvg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

sounds good. You're always welcome to check it in yourself as long as you
look after it. Send me your github username or create a git pull request

* Looking for MT/NLP opportunities *
Hieu Hoang
http://moses-smt.org/


On 22 April 2017 at 15:44, Jorg Tiedemann <tiedeman@gmail.com> wrote:

>
> Great - thanks. I modified a little to make it possible to pass a command
> to the new option.
> I hope I didn?t break anything. Maybe this modification could become part
> of the official moses package?
>
> J?rg
>
> J?rg Tiedemann
> tiedeman@gmail.com
>
>
>
>
>
>
>
> On 22 Apr 2017, at 15:50, Anoop (?????) <anoop.kunchukuttan@gmail.com>
> wrote:
>
> Hi Jorg,
>
> I had made changes to mert-moses.perl to achieve exactly what you are
> looking for. Please find the script attached.
>
> To enable character level to word level transformation, you have to pass
> the option '--transform-decoded-file' to mert-moses.pl
> The script assumes that a caret token '^' has been added between words
> while preprocessing the corpora. So, all subwords between two carets are
> merged to create a single word. The changes are on line 826--834.
>
> Regards,
> Anoop.
>
>
>
> On Sat, Apr 22, 2017 at 5:10 PM, Jorg Tiedemann <tiedeman@gmail.com>
> wrote:
>
>> Hi,
>>
>> Is there an easy way to integrate a small script to process n-best lists
>> in mert-moses.perl before running mert at each iteration? An example would
>> be to merge character-level translations to run mert on word-level
>> segmentations. It?s probably rather straightforward to add an option to
>> specify a script for filtering but it may already exist and I just don?t
>> see it?
>>
>> Thanks!
>> J?rg
>>
>> ************************************************************
>> **********************************
>> J?rg Tiedemann
>> Department of Modern Languages http://blogs.helsinki.fi/tiedeman/
>> University of Helsinki
>> http://blogs.helsinki.fi/language-technology/
>>
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
>
> --
> I claim to be a simple individual liable to err like any other fellow
> mortal. I own, however, that I have humility enough to confess my errors
> and to retrace my steps.
>
> http://flightsofthought.blogspot.com
> <mert-moses.pl>_______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170422/7d8adbc7/attachment-0001.html

------------------------------

Message: 2
Date: Sun, 23 Apr 2017 00:27:42 +0300
From: Ergun Bicici <bicici@gmail.com>
Subject: Re: [Moses-support] MERT: process n-best list before mert
To: Hieu Hoang <hieuhoang@gmail.com>
Cc: Jorg Tiedemann <tiedeman@gmail.com>, moses-support
<moses-support@mit.edu>
Message-ID:
<CAB59qTP_7onP4rmPc1Z6qHFLkQEunvK5t6Tr7H04K2F8dHSQpQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Dear Jorg,

I encountered similar issue for my character SMT experiments concurrently
looking at your paper (
http://aclanthology.info/papers/combining-word-level-and-character-level-models-for-machine-translation-between-closely-related-languages)
and added a line to call a script before the gzip of the nbest file within
the MERT main loop.

At the end of section 2, you mention that:
"
[image: Inline image 1]
" Preslav Nakov <http://aclanthology.info/people/preslav-nakov-5181> | J?rg
Tiedemann <http://aclanthology.info/people/jorg-tiedemann-3696>
*Anthology:*P12-2059 Volume:Proceedings of the 50th Annual Meeting of the
Association for Computational Linguistics (Volume 2: Short Papers)
<http://aclanthology.info/volumes/proceedings-of-the-50th-annual-meeting-of-the-association-for-computational-linguistics-volume-2-short-papers>
*Authors:*Preslav Nakov
<http://aclanthology.info/people/preslav-nakov-5181> | J?rg Tiedemann
<http://aclanthology.info/people/jorg-tiedemann-3696> *Month:*July *Year:*
2012 *Venue:*ACL <http://aclanthology.info/venues/acl> *Address:*Jeju
Island, Korea *SIG:* *Publisher:*Association for Computational Linguistics
*Pages:*301?305 *URL:*http://aclweb.org/anthology/P12-2059

Regards,
Ergun

On Sat, Apr 22, 2017 at 8:41 PM, Hieu Hoang <hieuhoang@gmail.com> wrote:

> sounds good. You're always welcome to check it in yourself as long as you
> look after it. Send me your github username or create a git pull request
>
> * Looking for MT/NLP opportunities *
> Hieu Hoang
> http://moses-smt.org/
>
>
> On 22 April 2017 at 15:44, Jorg Tiedemann <tiedeman@gmail.com> wrote:
>
>>
>> Great - thanks. I modified a little to make it possible to pass a command
>> to the new option.
>> I hope I didn?t break anything. Maybe this modification could become part
>> of the official moses package?
>>
>> J?rg
>>
>> J?rg Tiedemann
>> tiedeman@gmail.com
>>
>>
>>
>>
>>
>>
>>
>> On 22 Apr 2017, at 15:50, Anoop (?????) <anoop.kunchukuttan@gmail.com>
>> wrote:
>>
>> Hi Jorg,
>>
>> I had made changes to mert-moses.perl to achieve exactly what you are
>> looking for. Please find the script attached.
>>
>> To enable character level to word level transformation, you have to pass
>> the option '--transform-decoded-file' to mert-moses.pl
>> The script assumes that a caret token '^' has been added between words
>> while preprocessing the corpora. So, all subwords between two carets are
>> merged to create a single word. The changes are on line 826--834.
>>
>> Regards,
>> Anoop.
>>
>>
>>
>> On Sat, Apr 22, 2017 at 5:10 PM, Jorg Tiedemann <tiedeman@gmail.com>
>> wrote:
>>
>>> Hi,
>>>
>>> Is there an easy way to integrate a small script to process n-best lists
>>> in mert-moses.perl before running mert at each iteration? An example would
>>> be to merge character-level translations to run mert on word-level
>>> segmentations. It?s probably rather straightforward to add an option to
>>> specify a script for filtering but it may already exist and I just don?t
>>> see it?
>>>
>>> Thanks!
>>> J?rg
>>>
>>> ************************************************************
>>> **********************************
>>> J?rg Tiedemann
>>> Department of Modern Languages http://blogs.helsinki.fi/tiedeman/
>>> University of Helsinki
>>> http://blogs.helsinki.fi/language-technology/
>>>
>>>
>>>
>>> _______________________________________________
>>> Moses-support mailing list
>>> Moses-support@mit.edu
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>
>>>
>>
>>
>> --
>> I claim to be a simple individual liable to err like any other fellow
>> mortal. I own, however, that I have humility enough to confess my errors
>> and to retrace my steps.
>>
>> http://flightsofthought.blogspot.com
>> <mert-moses.pl>_______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>


--

Regards,
Ergun
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170422/67ba4a07/attachment.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image.png
Type: image/png
Size: 24729 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20170422/67ba4a07/attachment.png

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 126, Issue 30
**********************************************

0 Response to "Moses-support Digest, Vol 126, Issue 30"

Post a Comment