Moses-support Digest, Vol 86, Issue 2

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Re: Regarding XML (Philipp Koehn)
2. regarding previous xml question (Kalyani Baruah)


----------------------------------------------------------------------

Message: 1
Date: Sun, 1 Dec 2013 15:00:30 +0000
From: Philipp Koehn <pkoehn@inf.ed.ac.uk>
Subject: Re: [Moses-support] Regarding XML
To: Kalyani Baruah <kajubaruah04@gmail.com>
Cc: "moses-support@mit.edu" <moses-support@mit.edu>
Message-ID:
<CAAFADDDZt-gRNv0nD3NAe7scVP8cDxJGQux=8qtd5ZJ=RL5Z4Q@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1

Hi,

it is not entirely clear to me what you are asking here.

Moses uses as data format for the parallel corpus and the input
just plain one-sentence-per-line text. So you would have to
convert your XML files into this format.

If there are XML tags in the plain text such as <b>bold tags</b>
then there are number of ways to deal with that. The easiest is
to just strip them out, remember their position, and re-insert
them based on the word alignment.

-phi

On Sun, Dec 1, 2013 at 2:37 AM, Kalyani Baruah <kajubaruah04@gmail.com> wrote:
> Good day...
> My qustinon regarding xml ..as i have said that i was using a text file as
> an input to moses. bt i was to give an xml input. As in text file i am
> having my parallel corpus ..not a file with probability ratio mentioned..
> just a parallel corpus collection in two languages.so how to convert them as
> a xml file. so that i can take those two xml files as input for moses.
>
>
>
>
>
>
> Regards,
>
>
> Kalyanee Kanchan Baruah
> Department of Information Technology,
> Institute of Science and Technology,
> Gauhati University,Guwahati,India
> Phone- +91-9706242124
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>


------------------------------

Message: 2
Date: Sun, 1 Dec 2013 21:29:00 +0530
From: Kalyani Baruah <kajubaruah04@gmail.com>
Subject: [Moses-support] regarding previous xml question
To: moses-support@mit.edu
Message-ID:
<CAJZ5LDcEKcVth9MzEm-Bh3BU-2-+bKUpYkzcwkhvtOnSHkh3vQ@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

hello sir..

m sorry that i couldn't explain regarding xml.I am working on translation
and i have just completed translating using moses manual.but i am giving an
text file (parallel corpus) as an input.My translation was not 100per
accurate.As my corpus are in differend language..English- Bengali.I had to
transliterate the proper noun.After doin that i want to add my
transliterated output to the translate..so that i get the proper
output.After reading many papers i came to know that i have to give t the
transliterated output as input to translation module in an xml file
formate..(in order to join both translation and translation )to get the
final translated ouput.Hope m clear now i am in middle of my work now..as i
have a translated output which have words which are not transliterated and
with another output with transliterated(proper nouns). d
ont know how to join them..and dont knw how to give an xml file as input.
Shall i write all the sentences in the corrpus inside xml tags..and do
angain toknize,truecase etc..or are both same ie giving xml file or text
file doesnt matters..

Regards,


*Kalyanee Kanchan Baruah*
Department of Information Technology,
Institute of Science and Technology,
Gauhati University,Guwahati,India
Phone- +91-9706242124
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131201/11cb8211/attachment-0001.htm

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 86, Issue 2
********************************************

0 Response to "Moses-support Digest, Vol 86, Issue 2"

Post a Comment