Moses-support Digest, Vol 100, Issue 72

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Fwd: SGM files (Ricardo Cabello S?nchez)
2. Re: Fwd: SGM files (Barry Haddow)
3. SRI/KenLM kneser ney implementation (koormoosh)
4. Fwd: Fwd: SGM files (Ricardo Cabello S?nchez)
5. Fwd: Fwd: SGM files (Ricardo Cabello S?nchez)


----------------------------------------------------------------------

Message: 1
Date: Sun, 22 Feb 2015 11:44:51 +0100
From: Ricardo Cabello S?nchez
<ricardo.cabello.sanchez@googlemail.com>
Subject: [Moses-support] Fwd: SGM files
To: moses-support@mit.edu
Message-ID:
<CAJxWzkYHs2hcts__DBPq9WUJyP2SNPG830=AvxVmZTKZaHA+tg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi all,

I do not know if my message reached you.

Thank you

Ricardo

---------- Forwarded message ----------
From: Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
Date: 2015-02-21 19:51 GMT+01:00
Subject: SGM files
To: moses-support@mit.edu


Hi,

I would like to ask you how I can get sgm files for reference. I already
have txt files in ENG and in PTG correctly aligned. Is there any way to run
moses without these sgm files?

Thank you and best regards,

Ricardo
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/73ff2da1/attachment-0001.htm

------------------------------

Message: 2
Date: Sun, 22 Feb 2015 10:55:00 +0000
From: Barry Haddow <bhaddow@staffmail.ed.ac.uk>
Subject: Re: [Moses-support] Fwd: SGM files
To: Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
Cc: moses-support@mit.edu
Message-ID: <20150222105500.17032j4jyzvc8bs4@www.staffmail.ed.ac.uk>
Content-Type: text/plain; charset=ISO-8859-1; DelSp="Yes";
format="flowed"

Hi Ricardo

You only need sgm files if you want to evaluate with the nist bleu
script. If you want to use bleu to evaluate with txt files, just use
multi-bleu.perl (in scripts/generic)

cheers - Barry

Quoting Ricardo Cabello S?nchez
<ricardo.cabello.sanchez@googlemail.com> on Sun, 22 Feb 2015 11:44:51
+0100:

> Hi all,
>
> I do not know if my message reached you.
>
> Thank you
>
> Ricardo
>
> ---------- Forwarded message ----------
> From: Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
> Date: 2015-02-21 19:51 GMT+01:00
> Subject: SGM files
> To: moses-support@mit.edu
>
>
> Hi,
>
> I would like to ask you how I can get sgm files for reference. I already
> have txt files in ENG and in PTG correctly aligned. Is there any way to run
> moses without these sgm files?
>
> Thank you and best regards,
>
> Ricardo
>



--
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.




------------------------------

Message: 3
Date: Sun, 22 Feb 2015 22:28:22 +1100
From: koormoosh <koormoosh@gmail.com>
Subject: [Moses-support] SRI/KenLM kneser ney implementation
To: moses-support@mit.edu
Message-ID:
<CAN3_CDiVY-NnzYGCyRnqQEeYKCOWOHV8p-42aH=sWwWzX9_4Ww@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi,

I wonder if SRI does any sort of implicit pruning or refinement? To be more
precise, is there any way to force SRI not to prune anything (removing
singletons, etc). I thought that my way of calling it does what I want (not
pruning), but then I don't know how to explain getting different results.
This is how I call SRI:

-----------------------------------------------------------------------------------------------------
./ngram-count -order 3 -text training.txt -write training.ngrams

./ngram-count -order 3 -read training.ngrams -lm training.binary
-interpolate -ukndiscount -gt1min 0 -gt2min 0 -gt3min 0 -write-binary-lm

./ngram -order 3 -lm training.binary -ppl test.txt -debug 2

am I missing/misusing something?

--------------------------------------------------------------------------------------------------------
An example to show this problem:
(Example-1):
Test: "13 13 13"
Training: "13 13 13 13 17"
perplexity *matches* SRI: "2.79327"

(Example-2):
Test: "13 13 13"
Training "13 13 13 13 13 13 17 17 17 17 17 14 14 15 15 15 16 16 16 16"
perplexity *doesn't match* SRI: "4.51546" and what SRI returns us "4.242".
-------------------------------------------------------------------------------------------------------

Thanks in advance,
Koorm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/efcfbdcd/attachment-0001.htm

------------------------------

Message: 4
Date: Sun, 22 Feb 2015 12:50:24 +0100
From: Ricardo Cabello S?nchez
<ricardo.cabello.sanchez@googlemail.com>
Subject: [Moses-support] Fwd: Fwd: SGM files
To: moses-support@mit.edu, Barry Haddow <bhaddow@staffmail.ed.ac.uk>
Message-ID:
<CAJxWzkZ87oaJMcSj7sSzs1sxaCCrmBfFqWszhk9KqaJ5G5V_Uw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi Barry,

Thank you very much for your quick response. It helped a lot. Now I can
train my system much better. I do not have much expertise in scripts and
config files but I am doing my best. Do you know where I can find a quick
guidance to know how to configure config file?

Experiment crashed at test filter step. Could you please advice? Please
find attached log file.

Thank you and best regards,

Ricardo

2015-02-22 11:55 GMT+01:00 Barry Haddow <bhaddow@staffmail.ed.ac.uk>:

> Hi Ricardo
>
> You only need sgm files if you want to evaluate with the nist bleu script.
> If you want to use bleu to evaluate with txt files, just use
> multi-bleu.perl (in scripts/generic)
>
> cheers - Barry
>
>
> Quoting Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
> on Sun, 22 Feb 2015 11:44:51 +0100:
>
> Hi all,
>>
>> I do not know if my message reached you.
>>
>> Thank you
>>
>> Ricardo
>>
>> ---------- Forwarded message ----------
>> From: Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
>> Date: 2015-02-21 19:51 GMT+01:00
>> Subject: SGM files
>> To: moses-support@mit.edu
>>
>>
>> Hi,
>>
>> I would like to ask you how I can get sgm files for reference. I already
>> have txt files in ENG and in PTG correctly aligned. Is there any way to
>> run
>> moses without these sgm files?
>>
>> Thank you and best regards,
>>
>> Ricardo
>>
>>
>
>
> --
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/1bb56bed/attachment-0001.htm
-------------- next part --------------
A non-text attachment was scrubbed...
Name: EVALUATION_test_filter.10.STDERR
Type: application/octet-stream
Size: 1232 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/1bb56bed/attachment-0001.obj

------------------------------

Message: 5
Date: Sun, 22 Feb 2015 12:52:34 +0100
From: Ricardo Cabello S?nchez
<ricardo.cabello.sanchez@googlemail.com>
Subject: [Moses-support] Fwd: Fwd: SGM files
To: Barry Haddow <bhaddow@staffmail.ed.ac.uk>, moses-support@mit.edu
Message-ID:
<CAJxWzkZ59jseNEB+f-_6MhmxBn2YDxCR5f1OVGkp0meQDPTk-Q@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi,

I get problems also at
stepEVALUATION:test:reference-from-sgm crashed
step EVALUATION:test:analysis-coverage crashed

Coudl you please advice? Please find attached log files.

Thanks

Ricardo

2015-02-22 12:25 GMT+01:00 Ricardo Cabello S?nchez <
ricardo.cabello.sanchez@googlemail.com>:

> Hi Barry,
>
> Thank you very much for your quick response. It helped a lot. Now I can
> train my system much better. I do not have much expertise in scripts and
> config files but I am doing my best. Do you know where I can find a quick
> guidance to know how to configure config file?
>
> Experiment crashed at test filter step. Could you please advice? Please
> find attached log file.
>
> Thank you and best regards,
>
> Ricardo
>
> 2015-02-22 11:55 GMT+01:00 Barry Haddow <bhaddow@staffmail.ed.ac.uk>:
>
>> Hi Ricardo
>>
>> You only need sgm files if you want to evaluate with the nist bleu
>> script. If you want to use bleu to evaluate with txt files, just use
>> multi-bleu.perl (in scripts/generic)
>>
>> cheers - Barry
>>
>>
>> Quoting Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
>> on Sun, 22 Feb 2015 11:44:51 +0100:
>>
>> Hi all,
>>>
>>> I do not know if my message reached you.
>>>
>>> Thank you
>>>
>>> Ricardo
>>>
>>> ---------- Forwarded message ----------
>>> From: Ricardo Cabello S?nchez <ricardo.cabello.sanchez@googlemail.com>
>>> Date: 2015-02-21 19:51 GMT+01:00
>>> Subject: SGM files
>>> To: moses-support@mit.edu
>>>
>>>
>>> Hi,
>>>
>>> I would like to ask you how I can get sgm files for reference. I already
>>> have txt files in ENG and in PTG correctly aligned. Is there any way to
>>> run
>>> moses without these sgm files?
>>>
>>> Thank you and best regards,
>>>
>>> Ricardo
>>>
>>>
>>
>>
>> --
>> The University of Edinburgh is a charitable body, registered in
>> Scotland, with registration number SC005336.
>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/ae80b7c7/attachment.htm
-------------- next part --------------
A non-text attachment was scrubbed...
Name: EVALUATION_test_analysis-coverage.13.STDERR
Type: application/octet-stream
Size: 169 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/ae80b7c7/attachment.obj
-------------- next part --------------
A non-text attachment was scrubbed...
Name: EVALUATION_test_filter.13.STDERR
Type: application/octet-stream
Size: 1232 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/ae80b7c7/attachment-0001.obj
-------------- next part --------------
A non-text attachment was scrubbed...
Name: EVALUATION_test_reference-from-sgm.13.STDERR
Type: application/octet-stream
Size: 131 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20150222/ae80b7c7/attachment-0002.obj

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 100, Issue 72
**********************************************

Related Posts :

0 Response to "Moses-support Digest, Vol 100, Issue 72"

Post a Comment