Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Number of sentences (tokens) for tuning and testing
(Davood Mohammadifar)
2. truecase.perl (Vincent Nguyen)
3. Re: truecase.perl (Vincent Nguyen)
----------------------------------------------------------------------
Message: 1
Date: Sat, 26 Sep 2015 11:37:29 +0000
From: Davood Mohammadifar <davood_mf@hotmail.com>
Subject: [Moses-support] Number of sentences (tokens) for tuning and
testing
To: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID: <SNT150-W619CA92E749D08102FDA3E8C410@phx.gbl>
Content-Type: text/plain; charset="iso-8859-1"
Hello everyone
For testing and tuning in Persian to English statistical machine translation with Moses, how many sentences (tokens) should the Dev and test contain? what is your recommendation?
Both Persian and English sides have about 6 millions tokens (240K sentences).
Thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20150926/416b5c82/attachment-0001.html
------------------------------
Message: 2
Date: Sat, 26 Sep 2015 16:37:16 +0200
From: Vincent Nguyen <vnguyen@neuf.fr>
Subject: [Moses-support] truecase.perl
To: moses-support <moses-support@mit.edu>
Message-ID: <5606AD9C.40801@neuf.fr>
Content-Type: text/plain; charset=utf-8; format=flowed
Hello,
Quick question regarding this script behavior.
<g id="1">Les Banques de la zone Euro sont soumises ? :</g>
becomes
<g id="1"> les banques de la zone euro sont soumises ? :</g>
lowercasing is fine
the space between >Les is fine
but it did not insert a space between the after the : in :</g>
any clue ?
Vincent
------------------------------
Message: 3
Date: Sat, 26 Sep 2015 16:50:27 +0200
From: Vincent Nguyen <vnguyen@neuf.fr>
Subject: Re: [Moses-support] truecase.perl
To: moses-support <moses-support@mit.edu>
Message-ID: <5606B0B3.40204@neuf.fr>
Content-Type: text/plain; charset=utf-8; format=flowed
actually after > space is always inserted, but before < never inserted.
Le 26/09/2015 16:37, Vincent Nguyen a ?crit :
> Hello,
>
> Quick question regarding this script behavior.
>
> <g id="1">Les Banques de la zone Euro sont soumises ? :</g>
>
> becomes
>
> <g id="1"> les banques de la zone euro sont soumises ? :</g>
>
> lowercasing is fine
> the space between >Les is fine
> but it did not insert a space between the after the : in :</g>
>
> any clue ?
>
> Vincent
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 107, Issue 61
**********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 107, Issue 61"
Post a Comment