Moses-support Digest, Vol 109, Issue 62

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."

Today's Topics:

1. Moses + MPI (Benson Muite)
2. Re: Adding feature to moses (Hieu Hoang)
3. Re: different versions of moses yielding different
translations (Vito Mandorino)

----------------------------------------------------------------------

Message: 1
Date: Wed, 25 Nov 2015 14:28:48 +0200
From: Benson Muite <benson.muite@ut.ee>
Subject: [Moses-support] Moses + MPI
To: moses-support@mit.edu
Message-ID: <5655A980.7000908@ut.ee>
Content-Type: text/plain; charset=utf-8

Hi,

IS there any information on performance improvement that can be obtained
using Moses + MPI? Source code seems to have some MPI available for
training using MIRA:

https://github.com/moses-smt/mosesdecoder/blob/82527fc8b247ed34947c33ba808a92a1c07b054b/contrib/mira/Main.cpp

It would be helpful to know how useful this is if anyone has tried it.

Thanks,
Benson

------------------------------

Message: 2
Date: Wed, 25 Nov 2015 13:16:25 +0000
From: Hieu Hoang <hieuhoang@gmail.com>
Subject: Re: [Moses-support] Adding feature to moses
To: haouarid@iro.umontreal.ca, moses-support@mit.edu
Message-ID: <5655B4A9.7000602@gmail.com>
Content-Type: text/plain; charset=windows-1252; format=flowed

I'm not sure what you're trying to do. Can you give an example.

On 24/11/2015 22:46, haouarid@iro.umontreal.ca wrote:
> Hi,
>
> I would like to add a word by word translation model to moses as a new
> feature. I didn't understand how it is possible to have access to the
> source and target words. As it is explained in the documentation,
> StringPiece is accessible by Factor class which is accessible by Word
> class which is accessible by TargetPhrase which is given as argument in
> the evaluation method like EvaluateWithSourceContext. I don't use factors
> with my translation model, I just use surface form of words.
> How is it possible to read the string source and target words without
> using Factors?
>
> Thank you,
> Dorra
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>

--
Hieu Hoang
http://www.hoang.co.uk/hieu

------------------------------

Message: 3
Date: Wed, 25 Nov 2015 14:53:07 +0100
From: Vito Mandorino <vito.mandorino@linguacustodia.com>
Subject: Re: [Moses-support] different versions of moses yielding
different translations
To: Hieu Hoang <hieuhoang@gmail.com>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CA+8mSmFKnt-YKw4uHt1AB+w-=osyq3eF7PWhf5kaM1ggytevmA@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Thank you. In our tests it seems that with the OnDisk table the quality is
basically the same between the two versions of Moses (average 0.2
difference in score Bleu) but for the CompactPhraseTable the difference is
larger (2 points Bleu loss in average after re-tuning with the new version
of Moses, and more than 5 points Bleu in average without re-tuning).
Do you think a better quality would be obtained by running a complete
re-training of the model with the new version of Moses?

Best regards,
Vito

2015-11-24 16:31 GMT+01:00 Hieu Hoang <hieuhoang@gmail.com>:

> There was a change in the underlying datastructure for stacks, it changed
> from std::set (ordered) to boost::unordered_set.
>
> https://github.com/moses-smt/mosesdecoder/commit/6b182ee5e987a5b2823aea7eaaa7ef0457c6a30d
> This got some speed gains
>
> 1 5 10 15 20 25 30 35
> 56 real 4m57.795s real 1m19.005s real 0m51.636s real 0m49.624s
> real 0m49.869s real 0m52.475s real 0m53.806s real 0m54.957s 13/10
> baseline user 4m41.255s user 5m45.086s user 6m34.053s user
> 8m12.430s user 8m10.667s user 8m16.486s user 8m10.592s user
> 8m13.859s
> sys 0m16.514s sys 0m35.494s sys 0m54.513s sys 1m10.643s
> sys 1m18.449s sys 1m21.738s sys 1m23.133s sys 1m25.048s
>
>
>
>
>
>
>
>
>
> 57 real 4m41.148s real 1m16.002s real 0m50.747s real 0m48.711s
> real 0m49.130s real 0m51.473s real 0m53.141s real 0m54.513s (56)
> + unordered set stack user 4m23.968s user 5m30.356s user
> 6m26.167s user 7m39.286s user 7m56.229s user 7m52.669s user
> 7m56.978s user 7m56.216s
> sys 0m17.231s sys 0m35.063s sys 0m54.081s sys 1m10.137s
> sys 1m17.194s sys 1m22.912s sys 1m25.948s sys 1m26.247s
> However, the hypotheses are now added to the stack in a different order so
> there will be slight differences in results
>
>
> On 24/11/2015 13:53, Vito Mandorino wrote:
>
> Hi,
>
> in some of our tests a recent version of Moses (pulled from github last
> week) and an older one do not give the same translations on the same source
> segment (with the same moses.ini).
> Here is the 5-best list for the translation of 'test' with the last week
> version:
>
> 0 ||| test ||| LexicalReordering0= -1.1969 0 0 0 0 0 Distortion0= 0 LM0=
> -51.1788 WordPenalty0= -1 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -3.03811 -2.5834 -2.08503 -1.83075 ||| -1.27754
> 0 ||| testing ||| LexicalReordering0= 0 0 0 0 0 0 Distortion0= 0 LM0=
> -35.1495 WordPenalty0= -1 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -5.21045 -5.04877 -4.71131 -4.66382 ||| -1.70337
> 0 ||| funds ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0 LM0=
> -11.3753 WordPenalty0= -1 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -10.8209 -10.6835 -5.14555 -5.73388 ||| -1.77009
> 0 ||| known as a ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0
> LM0= -58.8877 WordPenalty0= -3 PhrasePenalty0= 1
> PhraseDictionaryMultiModel0= -4.42285 -11.9339 -5.14555 -18.0392 |||
> -1.89152
> 0 ||| as a ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0 LM0=
> -35.5353 WordPenalty0= -2 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -9.34698 -11.9339 -5.14555 -9.14874 ||| -1.89159
>
> and with the older version of Moses:
>
> 0 ||| funds ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0 LM0=
> -11.3753 WordPenalty0= -1 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -2.52548 -2.52544 -2.45544 -2.48609 ||| -0.815668
> 0 ||| as a ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0 LM0=
> -35.5353 WordPenalty0= -2 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -2.52464 -2.52565 -2.45544 -2.5244 ||| -0.953799
> 0 ||| as ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0 LM0=
> -34.1633 WordPenalty0= -1 PhrasePenalty0= 1 PhraseDictionaryMultiModel0=
> -2.5256 -2.52565 -2.45544 -2.48609 ||| -1.07254
> 0 ||| known as a ||| LexicalReordering0= -3.1355 0 0 0 0 0 Distortion0= 0
> LM0= -58.8877 WordPenalty0= -3 PhrasePenalty0= 1
> PhraseDictionaryMultiModel0= -2.38597 -2.52565 -2.45544 -2.52573 |||
> -1.07536
> 0 ||| is known as a ||| LexicalReordering0= -3.1355 0 0 0 0 0
> Distortion0= 0 LM0= -80.8518 WordPenalty0= -4 PhrasePenalty0= 1
> PhraseDictionaryMultiModel0= -2.37158 -2.52565 -2.45544 -2.52573 |||
> -1.18753
>
> This looks very strange. The only difference is in the phrase-table
> scores. Do you have any idea of what is going on? The only possibility
> which come to mind is maybe a different handling of the
> PhraseDictionaryMultiModel feature.
> The moses.ini is in attachment.
>
> Best regards,
>
> Vito
>
>
>
> --
> *M**. Vito MANDORINO -- Chief Scientist*
>
>
> [image: Description : Description : lingua_custodia_final full logo]
>
> *The Translation Trustee*
>
> *1, Place Charles de Gaulle, **78180 Montigny-le-Bretonneux*
>
> *Tel : +33 1 30 44 04 23 Mobile : +33 6 84 65 68 89
> <%2B33%206%2084%2065%2068%2089>*
>
> *Email :* *
> <massinissa.ahmim@linguacustodia.com>vito.mandorino@linguacustodia.com
> <vito.mandorino@linguacustodia.com>*
>
> *Website :* *www.linguacustodia.com <http://www.linguacustodia.com/> -
> www.thetranslationtrustee.com <http://www.thetranslationtrustee.com/>*
>
>
> _______________________________________________
> Moses-support mailing listMoses-support@mit.eduhttp://mailman.mit.edu/mailman/listinfo/moses-support
>
>
> --
> Hieu Hoanghttp://www.hoang.co.uk/hieu
>
>

--
*M**. Vito MANDORINO -- Chief Scientist*

[image: Description : Description : lingua_custodia_final full logo]

*The Translation Trustee*

*1, Place Charles de Gaulle, **78180 Montigny-le-Bretonneux*

*Tel : +33 1 30 44 04 23 Mobile : +33 6 84 65 68 89
<%2B33%206%2084%2065%2068%2089>*

*Email :* *vito.mandorino@linguacustodia.com
<massinissa.ahmim@linguacustodia.com>*

*Website :* *www.linguacustodia.com <http://www.linguacustodia.com/> -
www.thetranslationtrustee.com <http://www.thetranslationtrustee.com/>*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20151125/d4ee1295/attachment.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/jpeg
Size: 4421 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20151125/d4ee1295/attachment.jpe
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 4421 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20151125/d4ee1295/attachment.jpg

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

End of Moses-support Digest, Vol 109, Issue 62
**********************************************

Moses-support Digest, Vol 109, Issue 62

0 Response to "Moses-support Digest, Vol 109, Issue 62"

Post a Comment