Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

apertium-gen-stopwords-lextor(1) [debian man page]

apertium-gen-stopwords-lextor(1)										  apertium-gen-stopwords-lextor(1)

NAME
apertium-gen-stopwords-lextor - This application is part of ( apertium ) This tool is part of the apertium machine translation architecture: http://apertium.org. SYNOPSIS
apertium-gen-stopwords-lextor n input_file output_file DESCRIPTION
apertium-gen-stopwords-lextor is the application responsible for generating the list of stopwords used by the lexical selection module (apertium-lextor). Stopwords are ignored as they cannot have multiple translations. OPTIONS
n the desired number of stopwords. FILES
These are the kinds of parameters and files used with this tool: input_file contains a large preprocessed corpus (see apertium-preprocess-corpus-lextor). output_file The file which gets the generated stopwords. SEE ALSO
apertium-gen-lextorbil(1), apertium-gen-lextormono(1), apertium-preprocess-corpus-lextor(1), apertium-gen-wlist-lextor(1), aper- tium-gen-wlist-lextor-translation(1), apertium-lextor-eval(1), apertium-lextor(1). BUGS
Lots of...lurking in the dark and waiting for you! AUTHOR
(c) 2005,2006 Universitat d'Alacant / Universidad de Alicante. All rights reserved. 2006-12-12 apertium-gen-stopwords-lextor(1)

Check Out this Related Man Page

apertium-preprocess-corpus-lextor(1)									      apertium-preprocess-corpus-lextor(1)

NAME
apertium-preprocess-corpus-lextor - This application is part of ( apertium ) This tool is part of the apertium machine translation architecture: http://apertium.org. SYNOPSIS
apertium-preprocess-corpus-lextor data_dir translation_dir input_file output_file DESCRIPTION
apertium-preprocess-corpus-lextor is the application responsible for preprocessing the training corpus for the lexical selector training. OPTIONS
This tool currently has no options. FILES
These are the kinds of files and directories used with this tool: data_dir the path to the linguistic data to use. translation_dir the translation direction to use. input_file contains a large corpus in raw format. output_file The file which gets the preprocessed corpus. SEE ALSO
apertium-gen-lextorbil(1), apertium-gen-lextormono(1), apertium-gen-lextor-eval(1), apertium-gen-stopwords-lextor(1), aper- tium-gen-wlist-lextor(1), apertium-gen-wlist-lextor-translation(1), apertium-lextor(1). BUGS
Lots of...lurking in the dark and waiting for you! AUTHOR
(c) 2005,2006 Universitat d'Alacant / Universidad de Alicante. All rights reserved. 2006-12-12 apertium-preprocess-corpus-lextor(1)
Man Page

7 More Discussions You Might Find Interesting

1. Programming

Client and Server program gen by Makefile

I created a "ebanking.x" file and run it as " rpcgen -a ebaning.x" It gen a few of files to me which is - "ebanking.h", "ebanking_server.c", "ebanking_svc.c", "ebanking_client.c", "ebanking_clnt.c", "ebanking_xdr.c" and "Makefile" The content of "ebanking.x" : struct bankargs { ... (0 Replies)
Discussion started by: wongalan48
0 Replies

2. Shell Programming and Scripting

find a string and copy the string after that

Hi! just want to seek help on this: i have a file wherein i want to find a string and copy the string after that and paste that other string to a new file. ex: TOTAL 123456 find "TOTAL" and copy "123456" and paste "123456" to a new file NOTE: there are many "TOTAL" strings on that... (12 Replies)
Discussion started by: kingpeejay
12 Replies

3. Emergency UNIX and Linux Support

Incorrect Permissions on a file

Hi all, I am facing a problem currently, with incorrect permissions being set on a file that is being generated automatically. There is a cronjob which runs for an id uvstart, at 12:30 am ET which generates this file on a particular directory. When the file is generated, it has the permissions... (5 Replies)
Discussion started by: ggayathri
5 Replies

4. Shell Programming and Scripting

Linguistic project: extract co-occurrences from text corpus

Hello guys, I've got a big corpus (a huge text file in which words are separated by one or several spaces). I would like to know if there is a simple way - using awk for instance - to extract any co-occurrence appearing at least 3times through the whole corpus for a given word. By co-occurrence,... (7 Replies)
Discussion started by: bobylapointe
7 Replies

5. What is on Your Mind?

Do You Own a Kindle?

The Kindle 3 is making headlines as Amazon's #1 bestseller. According to Amazon: Do you own a Kindle? What do you think about them? (43 Replies)
Discussion started by: Neo
43 Replies

6. UNIX for Dummies Questions & Answers

Replacing stopwords based on a list

Dear all, I have Files with lines of text in them, I want to replace the stopwords in them with ",". I have create a file which contain the stopwords... I have been trying for last 3 hours but no success I have managed to replace one using "sed" and delete the line containing them using... (3 Replies)
Discussion started by: A-V
3 Replies

7. Shell Programming and Scripting

Grepping verbal forms from a large corpus

I want to extract verbal forms from a large corpus of English. I have identified a certain number of patterns. Each pattern has the following structure SPACE word_CATEGORY where word refers to the verbal form and CATEGORY refers to the class of the verb The categories are identified as per the... (4 Replies)
Discussion started by: gimley
4 Replies