Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

svm-subset(1) [debian man page]

svm-subset(1)							   User Manuals 						     svm-subset(1)

NAME
svm-subset - a subset selection tool for LIBSVM SYNOPSIS
svm-subset [ -s method ] dataset number [ output1 ] [ output2 ] DESCRIPTION
Training large data is time consuming. Sometimes one should work on a smaller subset first. The python script subset.py randomly selects a specified number of samples. For classification data, we provide a stratified selection to ensure the same class distribution in the sub- set. OPTIONS
-s method 0 -- stratified selection (classification only) (default) 1 -- random selection output1 The subset. If output1 is omitted, the subset will be printed on the screen. output2 The rest of data. FILES
See svm-train(1) for the format of dataset EXAMPLES
svm-subset heart_scale 100 file1 file2 From heart_scale 100 samples are randomly selected and stored in file1. All remaining instances are stored in file2. BUGS
Please report bugs to the Debian BTS. AUTHOR
Chih-Chung Chang, Chih-Jen Lin <cjlin@csie.ntu.edu.tw>, Chen-Tse Tsai <ctse.tsai@gmail.com> (packaging) SEE ALSO
svm-train(1), svm-predict(1) Linux DEC 2009 svm-subset(1)

Check Out this Related Man Page

LIBLINEAR-PREDICT(1)					      General Commands Manual					      LIBLINEAR-PREDICT(1)

NAME
liblinear-predict - Make predictions based on a trained linear classifier model SYNOPSIS
linear-predict [options] test_file model_file output_file DESCRIPTION
liblinear-predict uses the linear classifier model-file to make predictions for each of the samples in test_file and stores the results in output_file. OPTIONS
A summary of options is included below. -b (0|1) Whether to output probability estimates or not (default: 0) EXAMPLES
Train a linear SVM using L2-loss function with linear-train(1): liblinear-train data_file Output probability estimates (for logistic regression only): liblinear-predict -b 1 test_file data_file.model output_file SEE ALSO
liblinear-train(1), svm-predict(1), svm-train(1) AUTHORS
liblinear-predict was written by the LIBLINEAR authors at National Taiwan university for the LIBLINEAR Project. This manual page was written by Christian Kastner <debian@kvr.at>, for the Debian project (and may be used by others). March 08, 2011 LIBLINEAR-PREDICT(1)
Man Page

15 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

random function script

hi, I need a command which picks the records randomly from the file. For example. i am having some 10000 entries in a file and need to extract the lines randomly without repeating the numbers. Do anybody have any idea on how to get this out. (4 Replies)
Discussion started by: prash_b
4 Replies

2. Shell Programming and Scripting

need help with finding a word in file's contents

Hi, I need to check if a particular name is already in the file or not and i am using following code for this... match=$(grep -n -e "$output1" outputfiles.txt ) where output1 is the variable name having names in it and outputfiles.txt is the file name ..and i am using ksh can anybosy... (6 Replies)
Discussion started by: manmeet
6 Replies

3. Shell Programming and Scripting

How to extract a subset from a huge dataset

Hi, All I have a huge file which has 450G. Its tab-delimited format is as below x1 A 50020 1 x1 B 50021 8 x1 C 50022 9 x1 A 50023 10 x2 D 50024 5 x2 C 50025 7 x2 F 50026 8 x2 N 50027 1 : : Now, I want to extract a subset from this file. In this subset, column 1 is x10, column 2 is... (3 Replies)
Discussion started by: cliffyiu
3 Replies

4. Shell Programming and Scripting

simple join for multiple files and produce 3 outputs

sh script file1 filea fileb filec ................filez. >>output1 & output2 &output3 file1 z10 1873 1920 z_number1_E59 z10 2042 2090 z_number2_E59 Z22 2476 2560 z_number3_E59 Z22 2838 2915 z_number4_E59 z1 1873 1920 z_number1_E60 z1 ... (9 Replies)
Discussion started by: stateperl
9 Replies

5. UNIX for Dummies Questions & Answers

Normal format and scientific combined data

hi guys, i have a data with a column of p value (normal format and scientific combined). i want to creat a subset of data which only contains p-value: data 1: p<10^7 data 2: p<0.01 how should i do it? many thanks! data looks like: rs7841347 128887490 1.695e-007 rs1241347 ... (4 Replies)
Discussion started by: forevertl
4 Replies

6. Solaris

SVM Creation

Hello, I recently patched my Solaris 10 box and found out that few of the apps are not working. Fortunately, I had detached the mirroring prior to patching, so I just booted into my secondary disk and found that my apps are working.... The problem is this was way back in last month....see... (14 Replies)
Discussion started by: zigi_p5
14 Replies

7. Red Hat

Facing problems installing KVM on RHEL6

I am running RHEL 6 on VirualBox/VMWare VM and on that VM i am trying to create a KVM virtual machine. Issue is that command "egrep 'vmx|svm' /proc/cpuinfo doesn't show my any results, so that lets me think if my processor doesn't support Virtualization technology. But it does and i have... (5 Replies)
Discussion started by: Rohit Bhanot
5 Replies

8. UNIX for Dummies Questions & Answers

Random selection of subset of sample from file

Hello Could you please help me to find a code that can randomly select 1224 lines from a file of 12240 and make tn output with 1224 line each. my input is txt file with 12240 lines like : 13474 999003507 0 0 2 -9 13475 999003508 0 0 2 -9 13476 999003509 0 0 1 -9 13477 999003510 0 0 1 -9 ... (7 Replies)
Discussion started by: biopsy
7 Replies

9. Shell Programming and Scripting

Data manipulation using shell

Dear all I have a dataset (in text format,delimited by tab) which have 100 variables (say, var0-var99) and more than 100,000 observations. I want to do the following: 1. for variable var0-var49, I want to add "00" in front of each data (for example, "1" would become "001") 2. for variable... (8 Replies)
Discussion started by: littlewenwen
8 Replies

10. Shell Programming and Scripting

Detecting subset of a word

Each line of the file has some words exactly same letters as of the first one. But has zero or more "_+" inserted. I am interested in those words and remove the other cases. Example: abcde abcd_+e abcd_+de fghig fghigi fghi_+g klmn klmn I want to get this: abcde abcd_+e fghig fghi_+g ... (7 Replies)
Discussion started by: Viernes
7 Replies

11. Shell Programming and Scripting

Creating subset of a file based on specific columns

Hello Unix experts, I need a help to create a subset file. I know with cut comand, its very easy to select many different columns, or threshold. But here I have a bit problem as in my data file is big. And I don't want to identify the column numbers or names manually. I am trying to find any... (7 Replies)
Discussion started by: smitra
7 Replies

12. UNIX for Advanced & Expert Users

How to extract subset file from dataset?

Hello I have a data set which looks like this : progeny sire dam gender 12 1 3 M 13 2 4 F 14 2 5 F 15 6 5 ... (13 Replies)
Discussion started by: sajmar
13 Replies

13. Shell Programming and Scripting

Need to Preprocess a text file and convert into csv

Hello, I was working with Machine learning and would like to apply my regression algorithms on binary classification datasets. So I came across this adult dataset, LIBSVM Data: Classification (Binary Class) It is a binary dataset, features have values only 1 and 0. and I wanted to... (4 Replies)
Discussion started by: ajayram
4 Replies

14. UNIX for Dummies Questions & Answers

Awk: subset of fields as variable with sprint

(6 Replies)
Discussion started by: beca123456
6 Replies

15. UNIX for Beginners Questions & Answers

Bad substitution issues.. but why?

i am trying to prepare a train and test dataset, for which i need to randomly split the data into corresponding folders (train,test).. I began on a simple script, but seem to get som weird error messages, that i cannot make sense of?.. what am I doing wrong? #!/bin/bash RED='\033] then... (13 Replies)
Discussion started by: kidi
13 Replies