Moses-support Digest, Vol 99, Issue 53

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: Starting words are left untranslated (Tom Hoar)
2. Re: Missing processPhraseTable binary from bin directory
(Philipp Koehn)
3. Third and Last Call for Papers: 20th International Conference
on Applications of Natural Language to Information Systems
(NLDB'15) (Michael Zock)


----------------------------------------------------------------------

Message: 1
Date: Fri, 23 Jan 2015 09:36:54 +0700
From: Tom Hoar <tahoar@precisiontranslationtools.com>
Subject: Re: [Moses-support] Starting words are left untranslated
To: moses-support@mit.edu
Message-ID: <54C1B3C6.6040506@precisiontranslationtools.com>
Content-Type: text/plain; charset="windows-1252"

Re untranslated words, make sure when translating "another documents"
they are prepared the same as your parallel training corpus' source
language. This includes the upper/lower case and newline characters Hieu
mentioned. Also, make sure your target half and your language model are
also prepared the same.

Re the mismatch, fix the above first problem then retest this issue. If
it persists and because the mismatch is in the reordering table, think
about lowering the "distortion-limit" in your moses.ini file and retune
your engine with the new value.



On 01/23/2015 12:24 AM, Hieu Hoang wrote:
> is it 'cos it's uppercased?
>
> are you sure your input test set doesn't have the Windows return
> character (0x13)? Get rid of it by doing dos2unix
>
> On 22/01/15 17:11, CHATZITHEODOROU Konstantinos wrote:
>>
>> Hello Moses,
>>
>> I have trained an EN-IT phrase based MT Engine with a very high BLEU
>> score. I have also checked the test output and the MT quality is
>> pretty good.
>>
>> Now, I am trying to translate another document, different from the
>> test set but I realised that a large number of words are left
>> untranslated even if in the phrase translation table these words
>> appear having their own entries (phrase length =1). Moreover, I
>> observed that most of the times, the first word of each sentence is
>> untranslated. I run moses decoder with -output-unknowns option.
>>
>> Here are some examples:
>>
>>
>> Think and respond to the questions . - > Think e rispondere alle
>> domande .
>>
>> I think . -> Io penso .
>>
>>
>> One more issue. The word "real" is translated as "rear" (posteriore)
>> in Italian. Again, I checked the phrase translation table entries and
>> there is no matching between these two words. The only match is in
>> the reordering table.
>>
>>
>> Do you have any idea what is going wrong and what I can do?
>>
>>
>> Thank you for your time.
>>
>>
>> Best,
>>
>> Konstantinos
>>
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150123/1cd90a00/attachment-0001.htm

------------------------------

Message: 2
Date: Thu, 22 Jan 2015 22:07:55 -0500
From: Philipp Koehn <phi@jhu.edu>
Subject: Re: [Moses-support] Missing processPhraseTable binary from
bin directory
To: Mario Diana <mdiana@technicallycreative.com>
Cc: "moses-support@mit.edu" <moses-support@mit.edu>, Hieu Hoang
<hieu.hoang@ed.ac.uk>
Message-ID:
<CAAFADDCT1EuRwVbJMoEDOppzS705ttw3mckTEFyEgPxN6PZ-7w@mail.gmail.com>
Content-Type: text/plain; charset=UTF-8

Hi,

I fixed the tutorial.

-phi

On Thu, Jan 22, 2015 at 5:07 PM, Mario Diana
<mdiana@technicallycreative.com> wrote:
> Thank you, Hieu. I read that, but couldn?t quite tell if that was my problem (as the old utility is still mentioned in the tutorial). If it isn?t too much trouble could you explain how I might get my hands on one of the 4 alternatives? I?m quite new to all of this. Thank you.
>
> From: hieuhoang@gmail.com [mailto:hieuhoang@gmail.com] On Behalf Of Hieu Hoang
> Sent: Thursday, January 22, 2015 4:58 PM
> To: Mario Diana
> Cc: moses-support@mit.edu
> Subject: Re: [Moses-support] Missing processPhraseTable binary from bin directory
>
> fyi
> https://www.mail-archive.com/moses-support@mit.edu/msg11367.html
>
>
> On 22 January 2015 at 21:55, Mario Diana <mdiana@technicallycreative.com> wrote:
> Hello,
>
> [...]
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support



------------------------------

Message: 3
Date: Fri, 23 Jan 2015 12:56:59 +0100
From: Michael Zock <Michael.Zock@lif.univ-mrs.fr>
Subject: [Moses-support] Third and Last Call for Papers: 20th
International Conference on Applications of Natural Language to
Information Systems (NLDB'15)
To: corpora@uib.no, moses-support@mit.edu
Message-ID: <54C2370B.5000302@lif.univ-mrs.fr>
Content-Type: text/plain; charset="us-ascii"

An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150123/b84f40f7/attachment.htm

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 99, Issue 53
*********************************************

0 Response to "Moses-support Digest, Vol 99, Issue 53"

Post a Comment