Moses-support Digest, Vol 119, Issue 32

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: accessing the compact format of the phrase-table
(Dimitar Shterionov)
2. accessing the compact format of the phrase-table (Tom Hoar)
3. Re: accessing the compact format of the phrase-table
(Dimitar Shterionov)


----------------------------------------------------------------------

Message: 1
Date: Fri, 23 Sep 2016 10:32:36 +0100
From: Dimitar Shterionov <dimitars@kantanmt.com>
Subject: Re: [Moses-support] accessing the compact format of the
phrase-table
To: Marcin Junczys-Dowmunt <junczys@amu.edu.pl>
Cc: moses-support@mit.edu
Message-ID:
<CALi0KgGXu7Zdintcn2manP+y1jeMubBxjWNtb2s24-BGBc=jXQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Thanks a lot Marcin.

I was fearing that would be the case :).

Cheers,
Dimitar.

Dimitar Shterionov | dimitars@kantanmt.com | Machine Translation Researcher

www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
Software. No Hardware. No Hassle MT.

<https://www.facebook.com/KantanMT>
<https://plus.google.com/+Kantanmt_cloudmachinetranslation>
<https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
<http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
<http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>
<info@kantanmt.com>


On 23 September 2016 at 10:26, Marcin Junczys-Dowmunt <junczys@amu.edu.pl>
wrote:

> Hi Dimitar,
> full string unfortunately. This is a hash table, so no partial methods
> possible. The source string itself isn't stored anywhere.
> Best,
> Marcin
>
> W dniu 23/09/16 o 10:24, Dimitar Shterionov pisze:
>
> Hi Marcin and thanks for the quick reply.
>
> With the queryPhraseTableMin do I always need to provide exact string to
> match an entry or can I actually provide partial string? or regex?
> Basically I want to extract all 1-grams. Any suggestions?
>
> Thanks once again.
> Dimitar.
>
> Dimitar Shterionov | dimitars@kantanmt.com | Machine Translation
> Researcher
>
> www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
> Software. No Hardware. No Hassle MT.
>
> <https://www.facebook.com/KantanMT>
> <https://plus.google.com/+Kantanmt_cloudmachinetranslation>
> <https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
> <http://www.slideshare.net/kantanmt>
> <https://www.youtube.com/user/KantanMT> <http://kantanmtblog.com/>
> <https://kantanmt.com/rssfeeds.php> <info@kantanmt.com>
>
>
> On 23 September 2016 at 10:19, Marcin Junczys-Dowmunt <junczys@amu.edu.pl>
> wrote:
>
>> Hi,
>> If you want a complete dump of the phrase table as text, this is not
>> possible. The compact phrase table is not reversible. You can use
>> queryPhraseTableMin to ask for the translations of specific phrases.
>> Best,
>> Marcin
>>
>> W dniu 23/09/16 o 10:16, Dimitar Shterionov pisze:
>>
>> Dear all,
>>
>> I want to extract some specific data from the table after the model has
>> been built. Before it is compacted I have it in raw format and it is easy.
>> But I don't know how to process and read the compacted format of the
>> phrase-table. Is there a way to read the phrase-table.minphr format as
>> text? Any suggestions?
>>
>> Thank you very much.
>> Kind regards,
>> Dimitar.
>>
>>
>> _______________________________________________
>> Moses-support mailing listMoses-support@mit.eduhttp://mailman.mit.edu/mailman/listinfo/moses-support
>>
>> _______________________________________________ Moses-support mailing
>> list Moses-support@mit.edu http://mailman.mit.edu/mailman
>> /listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160923/b1126014/attachment-0001.html

------------------------------

Message: 2
Date: Fri, 23 Sep 2016 16:37:58 +0700
From: Tom Hoar <tahoar@pttools.net>
Subject: [Moses-support] accessing the compact format of the
phrase-table
To: moses-support@mit.edu
Message-ID: <1c7e478b-91c4-0fa4-342d-e6ee3f1ad28c@pttools.net>
Content-Type: text/plain; charset="windows-1252"

Dimitar, is there any reason you can't go back to the original text
phrase-table that was used to create the compact format?



On 9/23/2016 4:26 PM, moses-support-request@mit.edu wrote:
> Date: Fri, 23 Sep 2016 10:24:51 +0100
> From: Dimitar Shterionov<dimitars@kantanmt.com>
> Subject: Re: [Moses-support] accessing the compact format of the
> phrase-table
> To: Marcin Junczys-Dowmunt<junczys@amu.edu.pl>
> Cc:moses-support@mit.edu
>
> Hi Marcin and thanks for the quick reply.
>
> With the queryPhraseTableMin do I always need to provide exact string to
> match an entry or can I actually provide partial string? or regex?
> Basically I want to extract all 1-grams. Any suggestions?
>
> Thanks once again.
> Dimitar.
>
> Dimitar Shterionov |dimitars@kantanmt.com | Machine Translation Researcher
>
> www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
> Software. No Hardware. No Hassle MT.
>
> <https://www.facebook.com/KantanMT>
> <https://plus.google.com/+Kantanmt_cloudmachinetranslation>
> <https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
> <http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
> <http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>
> <info@kantanmt.com>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160923/e7677138/attachment-0001.html

------------------------------

Message: 3
Date: Fri, 23 Sep 2016 10:44:57 +0100
From: Dimitar Shterionov <dimitars@kantanmt.com>
Subject: Re: [Moses-support] accessing the compact format of the
phrase-table
To: Tom Hoar <tahoar@pttools.net>
Cc: moses-support@mit.edu
Message-ID:
<CALi0KgGPjTnH6-8mfN4L82DU-ThOReW2GHsOCPX1L0hXzQh4tQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hello Tom,

When I store the phrase-table for later use I store only the compact
version - it's simply smaller.

Is there another way to get the original text phrase-table without
rebuilding?

Cheers,
Dimitar.

Dimitar Shterionov | dimitars@kantanmt.com | Machine Translation Researcher

www.KantanMT.com <http://www.kantanmt.com/> | Easy Translation - No
Software. No Hardware. No Hassle MT.

<https://www.facebook.com/KantanMT>
<https://plus.google.com/+Kantanmt_cloudmachinetranslation>
<https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt>
<http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT>
<http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php>
<info@kantanmt.com>


On 23 September 2016 at 10:37, Tom Hoar <tahoar@pttools.net> wrote:

> Dimitar, is there any reason you can't go back to the original text
> phrase-table that was used to create the compact format?
>
>
> On 9/23/2016 4:26 PM, moses-support-request@mit.edu wrote:
>
> Date: Fri, 23 Sep 2016 10:24:51 +0100
> From: Dimitar Shterionov <dimitars@kantanmt.com> <dimitars@kantanmt.com>
> Subject: Re: [Moses-support] accessing the compact format of the
> phrase-table
> To: Marcin Junczys-Dowmunt <junczys@amu.edu.pl> <junczys@amu.edu.pl>
> Cc: moses-support@mit.edu
>
> Hi Marcin and thanks for the quick reply.
>
> With the queryPhraseTableMin do I always need to provide exact string to
> match an entry or can I actually provide partial string? or regex?
> Basically I want to extract all 1-grams. Any suggestions?
>
> Thanks once again.
> Dimitar.
>
> Dimitar Shterionov | dimitars@kantanmt.com | Machine Translation Researcher
> www.KantanMT.com <http://www.kantanmt.com/> <http://www.kantanmt.com/> | Easy Translation - No
> Software. No Hardware. No Hassle MT.
> <https://www.facebook.com/KantanMT> <https://www.facebook.com/KantanMT><https://plus.google.com/+Kantanmt_cloudmachinetranslation> <https://plus.google.com/+Kantanmt_cloudmachinetranslation><https://twitter.com/KantanMT> <https://twitter.com/KantanMT> <https://www.linkedin.com/company/kantanmt> <https://www.linkedin.com/company/kantanmt><http://www.slideshare.net/kantanmt> <http://www.slideshare.net/kantanmt> <https://www.youtube.com/user/KantanMT> <https://www.youtube.com/user/KantanMT><http://kantanmtblog.com/> <http://kantanmtblog.com/> <https://kantanmt.com/rssfeeds.php> <https://kantanmt.com/rssfeeds.php><info@kantanmt.com> <info@kantanmt.com>
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20160923/bd1495de/attachment.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 119, Issue 32
**********************************************

0 Response to "Moses-support Digest, Vol 119, Issue 32"

Post a Comment