Send Moses-support mailing list submissions to
moses-support@mit.edu
To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu
You can reach the person managing the list at
moses-support-owner@mit.edu
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."
Today's Topics:
1. Reusing alignments for 10**9 fr-en and using fast_align
(rohit dholakia)
2. Re: Reusing alignments for 10**9 fr-en and using fast_align
(Philipp Koehn)
3. Deregistration from mailing list (Pratik Jain)
4. Re: Deregistration from mailing list (Hieu Hoang)
5. Output unknown words in moses server (Roee Aharoni)
6. How to install Boost for MOSES (Asad A.Malik)
----------------------------------------------------------------------
Message: 1
Date: Mon, 6 Jan 2014 13:08:00 -0800
From: rohit dholakia <rdholaki@sfu.ca>
Subject: [Moses-support] Reusing alignments for 10**9 fr-en and using
fast_align
To: moses-support <moses-support@MIT.EDU>
Message-ID:
<CAA==LgvVY89jWvUmuQBRED8e186u3cO12aK09+XfKyj2t2GGqw@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Hi,
I have been trying to get a fr-en phrase table by using the 10**9 fr-en
Europarl corpora. Unfortunately, 47 hours of mgiza cluster time was not
enough to get the alignments. If I restart, will Moses reuse the alignments
it has ? I used --parallel and --parts 8, so Moses has something in both
directions.
Also, I have been trying to use fast_align but it got a bad_alloc, must be
less memory. Having said that, can I use fast_align in the following manner
:
0. Run moses --last-step 1
1. use fast_align with -d -o -v
2. Resume Moses as --first-step 3 --last-step 6
Thanks !
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140106/154c0405/attachment-0001.htm
------------------------------
Message: 2
Date: Tue, 7 Jan 2014 01:21:05 +0000
From: Philipp Koehn <pkoehn@inf.ed.ac.uk>
Subject: Re: [Moses-support] Reusing alignments for 10**9 fr-en and
using fast_align
To: rohit dholakia <rdholaki@sfu.ca>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CAAFADDC7rFe-W=YBm-zqB9bprb5OH4XTqGaUJmC4p71G_m74bA@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Hi,
using the billion word French-English corpus, mgiza runs for a week even
with 8 cores.
So, fast_align is a good alternative.
Make sure that you create output in a way that the subsequent processing
steps
can deal with. The easiest way to do that is to use experimemt.perl (see the
example config files), otherwise run the following command to create
symmetrized word alignments:
/path/to/moses/scripts/ems/support/symmetrize-fast-align.perl
fast-align-output fast-align-inverse-output corpus.f corpus.e aligned
grow-diag-final-and /path/to/moses/bin/symal
-phi
On Mon, Jan 6, 2014 at 9:08 PM, rohit dholakia <rdholaki@sfu.ca> wrote:
> Hi,
>
> I have been trying to get a fr-en phrase table by using the 10**9 fr-en
> Europarl corpora. Unfortunately, 47 hours of mgiza cluster time was not
> enough to get the alignments. If I restart, will Moses reuse the alignments
> it has ? I used --parallel and --parts 8, so Moses has something in both
> directions.
>
> Also, I have been trying to use fast_align but it got a bad_alloc, must
> be less memory. Having said that, can I use fast_align in the following
> manner :
>
> 0. Run moses --last-step 1
>
> 1. use fast_align with -d -o -v
>
> 2. Resume Moses as --first-step 3 --last-step 6
>
> Thanks !
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140107/445ce56a/attachment-0001.htm
------------------------------
Message: 3
Date: Tue, 7 Jan 2014 12:12:26 +0530
From: Pratik Jain <tunetopj@gmail.com>
Subject: [Moses-support] Deregistration from mailing list
To: moses-support <Moses-support@mit.edu>
Message-ID:
<CACXoyhDPm8wpUshB1WZiU7GR83=MLMRs3WwQ-4K3obvteSZ_dQ@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Please sign me out of the mailing list.
Regards
Pratik Jain
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140107/7577fe13/attachment-0001.htm
------------------------------
Message: 4
Date: Tue, 7 Jan 2014 07:59:23 +0000
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] Deregistration from mailing list
To: Pratik Jain <tunetopj@gmail.com>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CAEKMkbh+hc4Hxvh=0jG_61=W9BM1_fA-v4YBNpAwsr2jEs29Lg@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Gladly. However, since you registered yourself, you should know how to
deregister yourself. The URL is at the bottom of every email you receive:
http://mailman.mit.edu/mailman/listinfo/moses-support
If you still have problems, you should contact the admin of the mailing
list (Me or Philipp Koehn). Please do not spam everyone on the mailing list.
On 7 January 2014 06:42, Pratik Jain <tunetopj@gmail.com> wrote:
> Please sign me out of the mailing list.
>
> Regards
> Pratik Jain
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140107/dc24f494/attachment-0001.htm
------------------------------
Message: 5
Date: Tue, 07 Jan 2014 08:08:11 -0800 (PST)
From: "Roee Aharoni" <roee.aharoni@gmail.com>
Subject: [Moses-support] Output unknown words in moses server
To: moses-support@mit.edu
Message-ID: <1389110890532.39c0ee5a@Nodemailer>
Content-Type: text/plain; charset="utf-8"
Hello,?I use the option -output-unknown in moses decoder, which outputs the unknown (untranslated) words to a file. Is there an equivalent output from the mosesserver process?
Thanks!?
Roee
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140107/362d96d4/attachment-0001.htm
------------------------------
Message: 6
Date: Tue, 7 Jan 2014 08:31:43 -0800 (PST)
From: "Asad A.Malik" <asad_12204@yahoo.com>
Subject: [Moses-support] How to install Boost for MOSES
To: "moses-support@mit.edu" <moses-support@MIT.EDU>
Message-ID:
<1389112303.58787.YahooMailNeo@web122201.mail.ne1.yahoo.com>
Content-Type: text/plain; charset="iso-8859-1"
Hi All,
I am trying to reinstall MOSES and for that I have to install Boost. So I wanted to know that if the command
sudo apt-get libboost-all-dev
is enough or I have to manually install it?
?
Regards?
Asad A.Malik
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20140107/97a4a17d/attachment.htm
------------------------------
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
End of Moses-support Digest, Vol 87, Issue 14
*********************************************
Subscribe to:
Post Comments (Atom)
0 Response to "Moses-support Digest, Vol 87, Issue 14"
Post a Comment