Moses-support Digest, Vol 126, Issue 35

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."

Today's Topics:

1. Re: Looking for a tool for training csv delimited and aligned
data (Allen Smith)
2. Re: Looking for a tool for training csv delimited and aligned
data (doc)
3. Re: Looking for a tool for training csv delimited and aligned
data (Vincent Nguyen)

----------------------------------------------------------------------

Message: 1
Date: Tue, 25 Apr 2017 16:53:59 -0500
From: Allen Smith <allen.w.smith1@gmail.com>
Subject: Re: [Moses-support] Looking for a tool for training csv
delimited and aligned data
To: doc <raymond.doctor@gmail.com>
Cc: moses-support@mit.edu
Message-ID:
<CALiVeREVzJZOz7P65S9dvg1LmspcaYWgjF_GaPwOAnsueYPNJw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Are you wanting to train something to align letters and sounds, or to
figure out the sounds given the letters? (As it happens, I've been working
on using GIZApp to do the former.)

-Allen (Allen W. Smith, Ph.D.)

On Tue, Apr 25, 2017 at 4:01 AM, doc <raymond.doctor@gmail.com> wrote:

> Hello,
> I am looking for a tool for training data using either Statistical methods
> or even CNN/RNN.
> Basically the tool would allow the user to train simple data and then once
> trained, it could be deployed to predict unknown data
> As an example, I have around 80,000 words in English converted to IPA
> [text aligned in CSV format] and would like to train the tool using the
> data, to predict in the case of new words.
> Using Moses is like using a surgeon's scalpel to saw wood
> And since I work in a Windows environment installing Moses is not very easy
> In any case a large number of linguists like me would prefer to have a
> tool with ease of use.I am sure that if such a tool is made available, it
> will be one of the most popular tools.
>
> Thanks in advance for any help.
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170425/3dc3bc37/attachment-0001.html

------------------------------

Message: 2
Date: Wed, 26 Apr 2017 08:27:51 +0530
From: doc <raymond.doctor@gmail.com>
Subject: Re: [Moses-support] Looking for a tool for training csv
delimited and aligned data
To: Allen Smith <allen.w.smith1@gmail.com>
Cc: moses-support@mit.edu
Message-ID:
<CAJxcEy9p56dS=-cce7EVrZadiW-8TbrddM6my9x_00=044M5oQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Many thanks for your kind interest.
Basically I aim was to find if a training tool exists which can train data
with the following format:
abc=def
Where the right hand is the input and the left hand is the output.
I chose English to UK IPA as an example. I have around 80,000+ strings of
English to IPA. An example is given below:
?em=?m
?neath=ni??
?shun=??n
?twas=tw?z
?twen=twi?n
?twen-decks=?twi?n-deks
?twere=tw??r
?twill=tw?l
?twixt=tw?kst
?twould=tw?d
?un=?n
A=ei?
A?s=e?z
A-bomb=?e?-b?m
If I had an unknown word like say
superpose
the tool after training should be able to predict
su?p??p??z
or something like that. Stress does not matter
I wonder if someone in the MOSES team could come up with a small tool which
can be used for such kind of work. As a linguist, I can assure you that
such a tool would be immensely popular and render great service to the
community.
I need not add that I will download GIZApp and try. I am trying out SciKit,
but the results are not very encouraging.
Many thanks

On Wed, Apr 26, 2017 at 3:23 AM, Allen Smith <allen.w.smith1@gmail.com>
wrote:

> Are you wanting to train something to align letters and sounds, or to
> figure out the sounds given the letters? (As it happens, I've been working
> on using GIZApp to do the former.)
>
> -Allen (Allen W. Smith, Ph.D.)
>
> On Tue, Apr 25, 2017 at 4:01 AM, doc <raymond.doctor@gmail.com> wrote:
>
>> Hello,
>> I am looking for a tool for training data using either Statistical
>> methods or even CNN/RNN.
>> Basically the tool would allow the user to train simple data and then
>> once trained, it could be deployed to predict unknown data
>> As an example, I have around 80,000 words in English converted to IPA
>> [text aligned in CSV format] and would like to train the tool using the
>> data, to predict in the case of new words.
>> Using Moses is like using a surgeon's scalpel to saw wood
>> And since I work in a Windows environment installing Moses is not very
>> easy
>> In any case a large number of linguists like me would prefer to have a
>> tool with ease of use.I am sure that if such a tool is made available, it
>> will be one of the most popular tools.
>>
>> Thanks in advance for any help.
>>
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170425/9ae54048/attachment-0001.html

------------------------------

Message: 3
Date: Wed, 26 Apr 2017 10:16:01 +0200
From: Vincent Nguyen <vnguyen@neuf.fr>
Subject: Re: [Moses-support] Looking for a tool for training csv
delimited and aligned data
To: moses-support@mit.edu
Message-ID: <954ce8b9-9cda-d5fb-078b-96b534d6090c@neuf.fr>
Content-Type: text/plain; charset="utf-8"

I think you mixed up input/ouput because in your example at the end, you
would like to get pronunciation of a given new word.
input is the left hand side and output is the pron.

If you are able to rework a little bit the right hand side of your data
(you need to stretch the phones one by one, based on unique ones)

then the tool you are looking for is this one:
https://github.com/sequitur-g2p/sequitur-g2p

Le 26/04/2017 ? 04:57, doc a ?crit :
> Many thanks for your kind interest.
> Basically I aim was to find if a training tool exists which can train
> data with the following format:
> abc=def
> Where the right hand is the input and the left hand is the output.
> I chose English to UK IPA as an example. I have around 80,000+ strings
> of English to IPA. An example is given below:
> ?em=?m
> ?neath=ni??
> ?shun=??n
> ?twas=tw?z
> ?twen=twi?n
> ?twen-decks=?twi?n-deks
> ?twere=tw??r
> ?twill=tw?l
> ?twixt=tw?kst
> ?twould=tw?d
> ?un=?n
> A=ei?
> A?s=e?z
> A-bomb=?e?-b?m
> If I had an unknown word like say
> superpose
> the tool after training should be able to predict
> su?p??p??z
> or something like that. Stress does not matter
> I wonder if someone in the MOSES team could come up with a small tool
> which can be used for such kind of work. As a linguist, I can assure
> you that such a tool would be immensely popular and render great
> service to the community.
> I need not add that I will download GIZApp and try. I am trying out
> SciKit, but the results are not very encouraging.
> Many thanks
>
>
>
> On Wed, Apr 26, 2017 at 3:23 AM, Allen Smith <allen.w.smith1@gmail.com
> <mailto:allen.w.smith1@gmail.com>> wrote:
>
> Are you wanting to train something to align letters and sounds, or
> to figure out the sounds given the letters? (As it happens, I've
> been working on using GIZApp to do the former.)
>
> -Allen (Allen W. Smith, Ph.D.)
>
> On Tue, Apr 25, 2017 at 4:01 AM, doc <raymond.doctor@gmail.com
> <mailto:raymond.doctor@gmail.com>> wrote:
>
> Hello,
> I am looking for a tool for training data using either
> Statistical methods or even CNN/RNN.
> Basically the tool would allow the user to train simple data
> and then once trained, it could be deployed to predict unknown
> data
> As an example, I have around 80,000 words in English converted
> to IPA [text aligned in CSV format] and would like to train
> the tool using the data, to predict in the case of new words.
> Using Moses is like using a surgeon's scalpel to saw wood
> And since I work in a Windows environment installing Moses is
> not very easy
> In any case a large number of linguists like me would prefer
> to have a tool with ease of use.I am sure that if such a tool
> is made available, it will be one of the most popular tools.
>
> Thanks in advance for any help.
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu <mailto:Moses-support@mit.edu>
> http://mailman.mit.edu/mailman/listinfo/moses-support
> <http://mailman.mit.edu/mailman/listinfo/moses-support>
>
>
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170426/c0c54d24/attachment.html

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

End of Moses-support Digest, Vol 126, Issue 35
**********************************************

Moses-support Digest, Vol 126, Issue 35

0 Response to "Moses-support Digest, Vol 126, Issue 35"

Post a Comment