svm-subset(1) User Manuals svm-subset(1)NAME
svm-subset - a subset selection tool for LIBSVM
SYNOPSIS
svm-subset [ -s method ] dataset number [ output1 ] [ output2 ]
DESCRIPTION
Training large data is time consuming. Sometimes one should work on a smaller subset first. The python script subset.py randomly selects a
specified number of samples. For classification data, we provide a stratified selection to ensure the same class distribution in the sub-
set.
OPTIONS -s method
0 -- stratified selection (classification only) (default)
1 -- random selection
output1
The subset. If output1 is omitted, the subset will be printed on the screen.
output2
The rest of data.
FILES
See svm-train(1) for the format of dataset
EXAMPLES
svm-subset heart_scale 100 file1 file2
From heart_scale 100 samples are randomly selected and stored in file1. All remaining instances are stored in file2.
BUGS
Please report bugs to the Debian BTS.
AUTHOR
Chih-Chung Chang, Chih-Jen Lin <cjlin@csie.ntu.edu.tw>, Chen-Tse Tsai <ctse.tsai@gmail.com> (packaging)
SEE ALSO svm-train(1), svm-predict(1)Linux DEC 2009 svm-subset(1)
Check Out this Related Man Page
LIBLINEAR-PREDICT(1) General Commands Manual LIBLINEAR-PREDICT(1)NAME
liblinear-predict - Make predictions based on a trained linear classifier model
SYNOPSIS
linear-predict [options] test_file model_file output_file
DESCRIPTION
liblinear-predict uses the linear classifier model-file to make predictions for each of the samples in test_file and stores the results in
output_file.
OPTIONS
A summary of options is included below.
-b (0|1)
Whether to output probability estimates or not (default: 0)
EXAMPLES
Train a linear SVM using L2-loss function with linear-train(1):
liblinear-train data_file
Output probability estimates (for logistic regression only):
liblinear-predict -b 1 test_file data_file.model output_file
SEE ALSO liblinear-train(1), svm-predict(1), svm-train(1)AUTHORS
liblinear-predict was written by the LIBLINEAR authors at National Taiwan university for the LIBLINEAR Project.
This manual page was written by Christian Kastner <debian@kvr.at>, for the Debian project (and may be used by others).
March 08, 2011 LIBLINEAR-PREDICT(1)
hi,
I need a command which picks the records randomly from the file.
For example. i am having some 10000 entries in a file and need to extract the lines randomly without repeating the numbers.
Do anybody have any idea on how to get this out. (4 Replies)
Hi,
I need to check if a particular name is already in the file or not and i am using following code for this...
match=$(grep -n -e "$output1" outputfiles.txt )
where output1 is the variable name having names in it and outputfiles.txt is the file name ..and i am using ksh
can anybosy... (6 Replies)
Hi, All
I have a huge file which has 450G. Its tab-delimited format is as below
x1 A 50020 1
x1 B 50021 8
x1 C 50022 9
x1 A 50023 10
x2 D 50024 5
x2 C 50025 7
x2 F 50026 8
x2 N 50027 1
:
:
Now, I want to extract a subset from this file. In this subset, column 1 is x10, column 2 is... (3 Replies)
hi guys,
i have a data with a column of p value (normal format and scientific combined). i want to creat a subset of data which only contains p-value:
data 1: p<10^7
data 2: p<0.01
how should i do it? many thanks!
data looks like:
rs7841347 128887490 1.695e-007
rs1241347 ... (4 Replies)
Hello,
I recently patched my Solaris 10 box and found out that few of the apps are not working. Fortunately, I had detached the mirroring prior to patching, so I just booted into my secondary disk and found that my apps are working....
The problem is this was way back in last month....see... (14 Replies)
I am running RHEL 6 on VirualBox/VMWare VM and on that VM i am trying to create a KVM virtual machine.
Issue is that command "egrep 'vmx|svm' /proc/cpuinfo doesn't show my any results, so that lets me think if my processor doesn't support Virtualization technology. But it does and i have... (5 Replies)
Hello
Could you please help me to find a code that can randomly select 1224 lines from a file of 12240 and make tn output with 1224 line each.
my input is txt file with 12240 lines like :
13474 999003507 0 0 2 -9
13475 999003508 0 0 2 -9
13476 999003509 0 0 1 -9
13477 999003510 0 0 1 -9
... (7 Replies)
Dear all
I have a dataset (in text format,delimited by tab) which have 100 variables (say, var0-var99) and more than 100,000 observations. I want to do the following:
1. for variable var0-var49, I want to add "00" in front of each data (for example, "1" would become "001")
2. for variable... (8 Replies)
Each line of the file has some words exactly same letters as of the first one. But has zero or more "_+" inserted. I am interested in those words and remove the other cases.
Example:
abcde abcd_+e abcd_+de
fghig fghigi fghi_+g
klmn klmn
I want to get this:
abcde abcd_+e
fghig fghi_+g ... (7 Replies)
Hello Unix experts,
I need a help to create a subset file. I know with cut comand, its very easy to select many different columns, or threshold. But here I have a bit problem as in my data file is big. And I don't want to identify the column numbers or names manually. I am trying to find any... (7 Replies)
Hello,
I was working with Machine learning and would like to apply my regression algorithms on binary classification datasets.
So I came across this adult dataset, LIBSVM Data: Classification (Binary Class)
It is a binary dataset, features have values only 1 and 0.
and I wanted to... (4 Replies)
i am trying to prepare a train and test dataset, for which i need to randomly split the data into corresponding folders (train,test)..
I began on a simple script, but seem to get som weird error messages, that i cannot make sense of?..
what am I doing wrong?
#!/bin/bash
RED='\033]
then... (13 Replies)