Moses-support Digest, Vol 92, Issue 37

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Creating a 2-gram language model (Rajkiran Rajkumar)
2. Help about Using Giza++ for comparable Corpora
(alireza tabebordbar)


----------------------------------------------------------------------

Message: 1
Date: Thu, 19 Jun 2014 11:49:21 +0530
From: Rajkiran Rajkumar <rajkiran2507@gmail.com>
Subject: [Moses-support] Creating a 2-gram language model
To: moses-support@mit.edu
Message-ID:
<CAHMStsJNs6pFSFuS4EkfAYMw52PKxSrampuxOK9V13v9yVVUzg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

In the baseline translation system, the command -
"~/irstlm/bin/build-lm.sh \
-i news-commentary-v8.fr-en.sb.en \
-t ./tmp -p -s improved-kneser-ney -o news-commentary-v8.fr-en.lm.en"

is used to create a "3-gram language model, removing singletons, smoothing
with improved Kneser-Ney, and adding sentence boundary symbols" is what the
tutorial says. How can I build a 2-gram language model?

And, in general, which is more efficient for a bilingual corpus of 160,000
sentences? 2-gram or 3-gram?

Thanks in advance,
Rajkiran
College of Engineering Guindy, India
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140619/39479919/attachment-0001.htm

------------------------------

Message: 2
Date: Thu, 19 Jun 2014 15:50:59 +0430
From: alireza tabebordbar <ar.tabebordbar@gmail.com>
Subject: [Moses-support] Help about Using Giza++ for comparable
Corpora
To: Moses-support@mit.edu
Message-ID:
<CAAECi_FBOgVWcOOasTt8ap40hLrKqRyPUsDp_CeZ83+L5_OXjw@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1

Hi all
I am Master degree Student and I'm New to SMT.
I extracted some Comparable sentences and I want to use Giza++ for
word alignment.I Think Giza Use EM algorithm for aligning words,
However I don't know that I can feed comparable sentences directly to
Giza++ or I have combine them with some Parallel sentences.


------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 92, Issue 37
*********************************************

0 Response to "Moses-support Digest, Vol 92, Issue 37"

Post a Comment