Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Re: Regarding Transliteration using Moses (Pratik Jain)
2. Regarding XML (Kalyani Baruah)
3. Re: Regarding Transliteration using Moses (Hassan Sajjad)
----------------------------------------------------------------------
Message: 1
Date: Sun, 1 Dec 2013 05:43:27 +0530
From: Pratik Jain <tunetopj@gmail.com>
Subject: Re: [Moses-support] Regarding Transliteration using Moses
To: Pranjal Das <pranjal4456@gmail.com>
Cc: moses-support <moses-support@mit.edu>, Hieu Hoang
<hieu.hoang@ed.ac.uk>, Prashant Mathur <pm.prashant@gmail.com>
Message-ID:
<CACXoyhAUQb6+R6bT1oGC-_jYwDUa=C--B3K7Vo3DrkL_LbCV2g@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Pranjal. as I told you in my previous reply, you can't get the whole word
transliterated. You have to break it, transliterate and join it again.
On Sat, Nov 30, 2013 at 12:24 PM, Pranjal Das <pranjal4456@gmail.com> wrote:
> Thank you prashant and pratik...i have done the splitting character-wise
> and then the training part...as you said...but i am unable to do the
> reassemble part. For eg The letter ' ? ' is translietrated to bh, ' ? '
> is transliterated to r..but the complete word
> ???? is not transliterated. it gives the same output ????. please can you
> guide me...how i can get the complete word.
> echo '????'| ~/mosesdecoder/bin/moses -f ~/work1/train/model/moses.ini
> gives the output ????
>
> *Pranjal Das*
> Department of Information Technology,
> Institute of Science and Technology,
> Gauhati University,Guwahati,Assam
> Phone- +91-8399879454
>
>
> On Fri, Nov 29, 2013 at 1:02 PM, Prashant Mathur <pm.prashant@gmail.com>wrote:
>
>> hi pranjal,
>>
>> when you trained the system there is no rule for translating ???? but
>> there is a translation rule for translating ?? and ??..
>> having said that if you want a good transliteration system you should
>> separate the vowels from consonants.. like if you want to split ???? try
>> spring it like
>> ? + ? + ? + ? -> bh + a + ra + ta
>> it can be even more fine grained
>>
>> and when you want to transliterate a word you would have to split the
>> test set like the training data.
>>
>> this is how i would do it.
>> i don't know how pratik did.
>>
>> just being curious.. have you tried translating from english to wx
>> notation in hindi
>>
>> Prashant
>> On Nov 29, 2013 10:33 AM, "Pranjal Das" <pranjal4456@gmail.com> wrote:
>>
>>> hii ,
>>> hav tried as you said pratik..but cant get the original word.as for
>>> example ???? as
>>> ?? ?? ->bha rat ,if i echo bha i get the word ?? but in its place if i
>>> write ???? not transliterated.plese help me out if you hav any idea
>>> about this..how to get that complete word..as those which are not
>>> translated i suppose must be transliterated.
>>> Thanks
>>>
>>>
>>> regards
>>>
>>> *Pranjal Das*
>>> Department of Information Technology,
>>> Institute of Science and Technology,
>>> Gauhati University,Guwahati,Assam
>>> Phone- +91-8399879454
>>>
>>>
>>> On Wed, Nov 27, 2013 at 9:44 PM, Pratik Jain <tunetopj@gmail.com> wrote:
>>>
>>>> Just get a parallel list of words in the 2 languages. Try to break each
>>>> word syllable-wise or simply character-wise. Thus, each line of the two
>>>> files would have single words with a space between each character (or each
>>>> syllable). Use these 2 files to train the system as normal translation task.
>>>> If your word list is large enough, this works quite nice. Try it out,
>>>> it worked good for me.
>>>>
>>>>
>>>> On Wed, Nov 27, 2013 at 10:49 PM, Hieu Hoang <Hieu.Hoang@ed.ac.uk>wrote:
>>>>
>>>>> There's no support for transliteration in Moses. If you create 1,
>>>>> please let us know and we can add it to Moses
>>>>>
>>>>>
>>>>> On 27 November 2013 07:44, Pranjal Das <pranjal4456@gmail.com> wrote:
>>>>>
>>>>>> hii,
>>>>>> i have completed my translation using moses toolkit,following the
>>>>>> manual for installing moses,GIZZA++,IRSTLM and have tried out the sample
>>>>>> models for translation and applying those on my own parallel
>>>>>> corpus(fr-en),now i am facing problems regarding transliteration as there
>>>>>> is no proper way given in the manual for Transliteration using moses.kindly
>>>>>> help me with transliteration steps.where i can find them.
>>>>>>
>>>>>>
>>>>>>
>>>>>> Thanku
>>>>>>
>>>>>> Regards,
>>>>>>
>>>>>> *Pranjal Das*
>>>>>> Department of Information Technology,
>>>>>> Gauhati University Institute of Science and Technology,Guwahati,India
>>>>>> Phone- +91-8399879454
>>>>>>
>>>>>>
>>>>>>
>>>>>> _______________________________________________
>>>>>> Moses-support mailing list
>>>>>> Moses-support@mit.edu
>>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Hieu Hoang
>>>>> Research Associate
>>>>> University of Edinburgh
>>>>> http://www.hoang.co.uk/hieu
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Moses-support mailing list
>>>>> Moses-support@mit.edu
>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>
>>>>>
>>>>
>>>
>>> _______________________________________________
>>> Moses-support mailing list
>>> Moses-support@mit.edu
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>
>>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131201/326d4ded/attachment-0001.htm
------------------------------
Message: 2
Date: Sun, 1 Dec 2013 08:07:43 +0530
From: Kalyani Baruah <kajubaruah04@gmail.com>
Subject: [Moses-support] Regarding XML
To: moses-support@mit.edu
Message-ID:
<CAJZ5LDd-bVEOC2Pgm4-Dk8TW7cdStu4E52RSMgGRh9gDsGbkUQ@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Good day...
My qustinon regarding xml ..as i have said that i was using a text file as
an input to moses. bt i was to give an xml input. As in text file i am
having my parallel corpus ..not a file with probability ratio mentioned..
just a parallel corpus collection in two languages.so how to convert them
as a xml file. so that i can take those two xml files as input for moses.
Regards,
*Kalyanee Kanchan Baruah*
Department of Information Technology,
Institute of Science and Technology,
Gauhati University,Guwahati,India
Phone- +91-9706242124
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131201/25b3fc2e/attachment-0001.htm
------------------------------
Message: 3
Date: Sun, 1 Dec 2013 10:03:43 +0300
From: Hassan Sajjad <sajjad@ims.uni-stuttgart.de>
Subject: Re: [Moses-support] Regarding Transliteration using Moses
To: Pratik Jain <tunetopj@gmail.com>
Cc: moses-support <moses-support@mit.edu>, Hieu Hoang
<hieu.hoang@ed.ac.uk>, Pranjal Das <pranjal4456@gmail.com>, Prashant
Mathur <pm.prashant@gmail.com>
Message-ID:
<CAOiX71YtHH=wt2c0dscWrWVezjXkGqGQa7+FJVqcEVgwTROZBA@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Hi Pranjal,
If the command you mentioned is of your transliteration system then you are
very close to done. Rather than specifying like the following:
echo '????'| ~/mosesdecoder/bin/moses -f ~/work1/train/model/moses.ini
put space between the characters of the input word. So, moses considers it
as a complete sentence and outputs its translation (transliteration)
according to the model.
echo '? ? ? ?'| ~/mosesdecoder/bin/moses -f ~/work1/train/model/moses.ini
Now your output will look something like 'bh a r a t'. You can use a simple
perl script to remove spaces from the characters.
You can also generate an N-best list of transliteration by using -n-best-list
listfile 100. See
http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc8
Cheers,
Hassan
On Sun, Dec 1, 2013 at 3:13 AM, Pratik Jain <tunetopj@gmail.com> wrote:
> Pranjal. as I told you in my previous reply, you can't get the whole word
> transliterated. You have to break it, transliterate and join it again.
>
>
> On Sat, Nov 30, 2013 at 12:24 PM, Pranjal Das <pranjal4456@gmail.com>wrote:
>
>> Thank you prashant and pratik...i have done the splitting character-wise
>> and then the training part...as you said...but i am unable to do the
>> reassemble part. For eg The letter ' ? ' is translietrated to bh, ' ? '
>> is transliterated to r..but the complete word
>> ???? is not transliterated. it gives the same output ????. please can
>> you guide me...how i can get the complete word.
>> echo '????'| ~/mosesdecoder/bin/moses -f ~/work1/train/model/moses.ini
>> gives the output ????
>>
>> *Pranjal Das*
>> Department of Information Technology,
>> Institute of Science and Technology,
>> Gauhati University,Guwahati,Assam
>> Phone- +91-8399879454
>>
>>
>> On Fri, Nov 29, 2013 at 1:02 PM, Prashant Mathur <pm.prashant@gmail.com>wrote:
>>
>>> hi pranjal,
>>>
>>> when you trained the system there is no rule for translating ???? but
>>> there is a translation rule for translating ?? and ??..
>>> having said that if you want a good transliteration system you should
>>> separate the vowels from consonants.. like if you want to split ???? try
>>> spring it like
>>> ? + ? + ? + ? -> bh + a + ra + ta
>>> it can be even more fine grained
>>>
>>> and when you want to transliterate a word you would have to split the
>>> test set like the training data.
>>>
>>> this is how i would do it.
>>> i don't know how pratik did.
>>>
>>> just being curious.. have you tried translating from english to wx
>>> notation in hindi
>>>
>>> Prashant
>>> On Nov 29, 2013 10:33 AM, "Pranjal Das" <pranjal4456@gmail.com> wrote:
>>>
>>>> hii ,
>>>> hav tried as you said pratik..but cant get the original word.as for
>>>> example ???? as
>>>> ?? ?? ->bha rat ,if i echo bha i get the word ?? but in its place if i
>>>> write ???? not transliterated.plese help me out if you hav any idea
>>>> about this..how to get that complete word..as those which are not
>>>> translated i suppose must be transliterated.
>>>> Thanks
>>>>
>>>>
>>>> regards
>>>>
>>>> *Pranjal Das*
>>>> Department of Information Technology,
>>>> Institute of Science and Technology,
>>>> Gauhati University,Guwahati,Assam
>>>> Phone- +91-8399879454
>>>>
>>>>
>>>> On Wed, Nov 27, 2013 at 9:44 PM, Pratik Jain <tunetopj@gmail.com>wrote:
>>>>
>>>>> Just get a parallel list of words in the 2 languages. Try to break
>>>>> each word syllable-wise or simply character-wise. Thus, each line of the
>>>>> two files would have single words with a space between each character (or
>>>>> each syllable). Use these 2 files to train the system as normal translation
>>>>> task.
>>>>> If your word list is large enough, this works quite nice. Try it out,
>>>>> it worked good for me.
>>>>>
>>>>>
>>>>> On Wed, Nov 27, 2013 at 10:49 PM, Hieu Hoang <Hieu.Hoang@ed.ac.uk>wrote:
>>>>>
>>>>>> There's no support for transliteration in Moses. If you create 1,
>>>>>> please let us know and we can add it to Moses
>>>>>>
>>>>>>
>>>>>> On 27 November 2013 07:44, Pranjal Das <pranjal4456@gmail.com> wrote:
>>>>>>
>>>>>>> hii,
>>>>>>> i have completed my translation using moses toolkit,following the
>>>>>>> manual for installing moses,GIZZA++,IRSTLM and have tried out the sample
>>>>>>> models for translation and applying those on my own parallel
>>>>>>> corpus(fr-en),now i am facing problems regarding transliteration as there
>>>>>>> is no proper way given in the manual for Transliteration using moses.kindly
>>>>>>> help me with transliteration steps.where i can find them.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Thanku
>>>>>>>
>>>>>>> Regards,
>>>>>>>
>>>>>>> *Pranjal Das*
>>>>>>> Department of Information Technology,
>>>>>>> Gauhati University Institute of Science and Technology,Guwaha
>>>>>>> ti,India
>>>>>>> Phone- +91-8399879454
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Moses-support mailing list
>>>>>>> Moses-support@mit.edu
>>>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> Hieu Hoang
>>>>>> Research Associate
>>>>>> University of Edinburgh
>>>>>> http://www.hoang.co.uk/hieu
>>>>>>
>>>>>>
>>>>>> _______________________________________________
>>>>>> Moses-support mailing list
>>>>>> Moses-support@mit.edu
>>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>>
>>>>>>
>>>>>
>>>>
>>>> _______________________________________________
>>>> Moses-support mailing list
>>>> Moses-support@mit.edu
>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>
>>>>
>>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131201/b8f36c33/attachment.htm
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 86, Issue 1
********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 86, Issue 1"
Post a Comment