Moses-support Digest, Vol 108, Issue 85

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. mismatch in alignment file when using -alignment-output-file
(Arefeh Kazemi)
2. Re: At least fileset to keep lm, tm and tuning result (Masa Taka)


----------------------------------------------------------------------

Message: 1
Date: Sat, 31 Oct 2015 14:00:41 +0330
From: Arefeh Kazemi <kazemina@dcu.ie>
Subject: [Moses-support] mismatch in alignment file when using
-alignment-output-file
To: moses-support@mit.edu
Message-ID:
<CAB2b+QUmtzP=2Efp_oDqCF0sV1sTru8wPTUr-UQ0vo4ehQqpWQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi

I needed the word alignment between the source and the output translation
and I used -alignment-output-file parameter. It gives me an alignment file
but there are some mismatches between the source sentences' length and the
alignment so that the highest index in the alignment is greater than the
sentence length.
for example, for the source sentence
"darya alexandrovna went alone to her room ."
the alignment is :
0-0 1-1 2-1 3-6 4-3 5-2 6-5 7-4 8-7 9-8

I checked the sentences but there is no strange string in them.

Does anyone know why this happens?!

Regards
Arefeh

--


*Email Disclaimer"This e-mail and any files transmitted with it are confidential and are intended solely for use by the addressee. Any unauthorised dissemination, distribution or copying of this message and any attachments is strictly prohibited. If you have received this e-mail in error, please notify the sender and delete the message. Any views or opinions presented in this e-mail may solely be the views of the author and cannot be relied upon as being those of Dublin City University. E-mail communications such as this cannot be guaranteed to be virus-free, timely, secure or error-free and Dublin City University does not accept liability for any such matters or their consequences. Please consider the environment before printing this e-mail."S?anadh R?omhphoist"T? an r?omhphost seo agus aon chomhad a sheoltar leis faoi r?n agus is lena ?s?id ag an seola? agus sin amh?in ?. T? cosc ioml?n ar scaipeadh, dh?ileadh n? ch?ipe?il neamh?daraithe ar an teachtaireacht seo agus ar ao!
n cheangalt?n at? ag dul leis. M? t? an r?omhphost seo faighte agat tr? dhearmad cuir sin in i?l le do thoil don seolt?ir agus scrios an teachtaireacht. D?fh?adfadh s? gurb iad tuairim? an ?dair agus sin amh?in at? in aon tuairim? no dearctha? at? curtha i l?thair sa r?omhphost seo agus n?or ch?ir glacadh leo mar thuairim? n? dhearctha? Ollscoil Chathair Bhaile ?tha Cliath. N? ghlactar leis go bhfuil cumars?id r?omhphoist den s?rt seo saor ? v?reas, in am, sl?n, n? saor ? earr?id agus n? ghlacann Ollscoil Chathair Bhaile ?tha Cliath le dliteanas in aon ch?s den s?rt sin n? as aon iarmhairt a d?eascr?dh astu. Cuimhnigh ar an timpeallacht le do thoil sula gcuireann t? an r?omhphost seo i gcl?."*

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20151031/93d27eaf/attachment-0001.html

------------------------------

Message: 2
Date: Sat, 31 Oct 2015 12:26:56 +0000
From: Masa Taka <grantaka36@gmail.com>
Subject: Re: [Moses-support] At least fileset to keep lm, tm and
tuning result
To: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<CAEJVhXfA8isii6Xc1NgCchUN=h1dPkq1uTrSD8y7ksYqqOcrhQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi Philipp,

I see clearly that, and will check and use files with moses.ini. Thanks.

Masataka



2015?10?30?(?) 14:20 Philipp Koehn <phi@jhu.edu>:

> Hi,
>

> to run a system, you will only need the moses.ini with the tuned weights
> and all the files that are referenced in there. This is minimally the
> language model and the phrase table, but may also include the reordering
> table and other fancy feature functions that have models (like OSM).
>
> -phi
>

> On Thu, Oct 29, 2015 at 9:24 PM, Masa Taka <grantaka36@gmail.com> wrote:
>
>> Question:
>> What files should I keep "at least" as the language model, translation
>> model and tuning result?
>>
>> For example, on English(f) to Japanese(e) translation, should the
>> following files be enough?
>> - model/ja/lm.ja.gz
>> - model/en-ja/phrase-table.gz
>> - tuning/en-ja/moses.ini
>>
>> Detail:
>> When performing Moses SMT by default commands on the same PC, the
>> directories model/ and tuning/ are overwritten. I would like to keep the
>> minimum fileset to compare different training conditions (-o etc.) and
>> translation conditions (-dl, -s etc.)
>>
>> I know it is preferable all files should be stored, but for this time, it
>> is enough to have the same output 'e.txt' when inputting the same 'f.txt.'
>>
>> moses -f moses.ini < f.txt 1> e.txt 2> output.err
>>
>> Regards,
>> Masataka
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20151031/09666e90/attachment-0001.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 108, Issue 85
**********************************************

0 Response to "Moses-support Digest, Vol 108, Issue 85"

Post a Comment