How to use prediction with classifier model

Soheyl Arab

unread,

Feb 11, 2016, 9:50:22 AM2/11/16

to python-weka-wrapper

Hi,

I want to predict with weka classifier model or python-weak-wrapper .

how can I do prediction?!

I do this with python-weak-wrapper :

>>> import weka.core.jvm as jvm
>>> jvm.start()
>>> from weka.core.converters import Loader
>>> loader = Loader(classname="weka.core.converters.ArffLoader")
>>> data = loader.load_file("/Users/Soheyl/Desktop/Test/train.arff")
>>> test = loader.load_file("/Users/Soheyl/Desktop/Test/test.arff")
>>> data.class_is_last()
>>> from weka.classifiers import Classifier
>>> classifier = Classifier(classname="weka.classifiers.trees.J48", options=["-C", "0.3"])
>>> classifier.build_classifier(data)
>>> import weka.core.serialization as serialization
>>> serialization.write_all("/Users/Soheyl/Desktop/Test/out.model", [classifier, Instances.template_instances(data)])
>>> objects = serialization.read_all("/Users/Soheyl/Desktop/Test/out.model")
>>> classifier2 = Classifier(jobject=objects[0])
>>> data2 = Instances(jobject=objects[1])

Now...??!!

How to use test.arff to prediction with out.model??
Can I use model that created and saved with weka in line 12 ?!

test.arff

train.arff

Peter Reutemann

unread,

Feb 11, 2016, 1:01:10 PM2/11/16

to python-weka-wrapper

> I want to predict with weka classifier model or python-weak-wrapper .
> how can I do prediction?!

See my answers below.

> I do this with python-weak-wrapper :
>
> >>> import weka.core.jvm as jvm
> >>> jvm.start()
> >>> from weka.core.converters import Loader
> >>> loader = Loader(classname="weka.core.converters.ArffLoader")
> >>> data = loader.load_file("/Users/Soheyl/Desktop/Test/train.arff")
> >>> test = loader.load_file("/Users/Soheyl/Desktop/Test/test.arff")
> >>> data.class_is_last()
> >>> from weka.classifiers import Classifier
> >>> classifier = Classifier(classname="weka.classifiers.trees.J48", options=["-C", "0.3"])
> >>> classifier.build_classifier(data)
> >>> import weka.core.serialization as serialization
> >>> serialization.write_all("/Users/Soheyl/Desktop/Test/out.model", [classifier, Instances.template_instances(data)])
> >>> objects = serialization.read_all("/Users/Soheyl/Desktop/Test/out.model")
> >>> classifier2 = Classifier(jobject=objects[0])
> >>> data2 = Instances(jobject=objects[1])
>
>
> Now...??!!
>
> How to use test.arff to prediction with out.model??

Here is an example for making predictions:

https://github.com/fracpete/python-weka-wrapper-examples/blob/master/src/wekaexamples/classifiers/output_class_distribution.py

> Can I use model that created and saved with weka in line 12 ?!

Sure. The following example shows how to save and read back in a classifier object:

https://github.com/fracpete/python-weka-wrapper-examples/blob/master/src/wekaexamples/core/serialization.py

Cheers, Peter

Soheyl Arab

unread,

Feb 13, 2016, 6:48:56 AM2/13/16

to python-weka-wrapper

Thank you so much.

I got it...

Snehal Gawade

unread,

Feb 23, 2017, 6:27:29 AM2/23/17

to python-weka-wrapper

Hello Soheyl, I also want same kind of code can you sen me your code.

Snehal Gawade

unread,

Feb 23, 2017, 6:35:57 AM2/23/17

to python-weka-wrapper

Hello Peter,
I tried codes provided on above links but i m getting errors. I created model using "train_save_model_example.py" but while running
"load_test_model_example.py" i m getting following error :

Generating predictions for your test set...
Traceback (most recent call last):
File "traintest.py", line 31, in <module>
model.test()
File "/home/snehal/wekapy.py", line 215, in test
ob_cat = int((pred[1].split(":"))[0])
ValueError: invalid literal for int() with base 10: 'True'

please help me through this.

I also tried other two programs but there also I m getting error

Peter Reutemann

unread,

Feb 23, 2017, 1:32:29 PM2/23/17

to python-weka-wrapper

> I tried codes provided on above links but i m getting errors. I created
> model using "train_save_model_example.py" but while running
> "load_test_model_example.py" i m getting following error :
>
>
> Generating predictions for your test set...
> Traceback (most recent call last):
> File "traintest.py", line 31, in <module>
> model.test()
> File "/home/snehal/wekapy.py", line 215, in test
> ob_cat = int((pred[1].split(":"))[0])
> ValueError: invalid literal for int() with base 10: 'True'
>
>
> please help me through this.
>
> I also tried other two programs but there also I m getting error

The example repository neither contains "train_save_model_example.py"
nor "load_test_model_example.py". You'd have to post the code, before
I can comment.

Cheers, Peter
--
Peter Reutemann
Dept. of Computer Science
University of Waikato, NZ
+64 (7) 858-5174
http://www.cms.waikato.ac.nz/~fracpete/
http://www.data-mining.co.nz/

Snehal Gawade

unread,

Feb 23, 2017, 1:51:06 PM2/23/17

to python-we...@googlegroups.com

Hello Peter,
Sorry for inconvenience, actually program is available at this link:

https://github.com/flyingsparx/WekaPy

You can refer code from this link.

#...................train_save_model_example.py

# This example demonstrates how one might train a model and then save
# it for loading and testing with later.
from wekapy import *
# CREATE NEW MODEL INSTANCE WITH A CLASSIFIER TYPE
model = Model(classifier_type = "bayes.BayesNet")
# CREATE TRAINING INSTANCES. LAST FEATURE IS THE PREDICTION OUTCOME
instance1 = Instance()
instance1.add_feature(Feature(name="num_milkshakes",value=46,possible_values="real"))
instance1.add_feature(Feature(name="is_sunny",value=True,possible_values="{False,
True}"))
instance1.add_feature(Feature(name="boys_in_yard",value=True,possible_values="{False
,True}"))
instance2 = Instance()
instance2.add_feature(Feature(name="num_milkshakes",value=2,possible_values="real"))
instance2.add_feature(Feature(name="is_sunny",value=False,possible_values="{False,
True}"))
instance2.add_feature(Feature(name="boys_in_yard",value=False,possible_values="{False,
True}"))
model.add_train_instance(instance1)
model.add_train_instance(instance2)
# FINALLY, TRAIN AND SAVE THE TRAINED MODEL TO FILE
model.train(folds=2, save_as="/path/to/model.model")

#...............................load_test_model_example.py
# This example demonstrates loading a pre-existing trained model and using
# this to test against.
from wekapy import *
# CREATE NEW MODEL INSTANCE WITH A CLASSIFIER TYPE
model = Model(classifier_type = "bayes.BayesNet")
# LOAD A PREVIOUSLY TRAINED MODEL INTO OUR model OBJECT FOR TESTING AGAINST
model.load_model("/path/to/model.model")
# CREATE TEST INSTANCES
test_instance1 = Instance()
test_instance1.add_feature(Feature(name="num_milkshakes",value=44,possible_values="real"))
test_instance1.add_feature(Feature(name="is_sunny",value=True,possible_values="{False,
True}"))
test_instance1.add_feature(Feature(name="boys_in_yard",value="?",possible_values="{False,
True}"))
test_instance2 = Instance()
test_instance2.add_feature(Feature(name="num_milkshakes",value=5,possible_values="real"))
test_instance2.add_feature(Feature(name="is_sunny",value=False,possible_values="{False,
True}"))
test_instance2.add_feature(Feature(name="boys_in_yard",value="?",possible_values="{False,
True}"))
model.add_test_instance(test_instance1)
model.add_test_instance(test_instance2)
# FINALLY, TEST AGAINST THE LOADED MODEL
model.test()
# CHECK THE PREDICTIONS:
predictions = model.predictions
for prediction in predictions:
print prediction

#................................................wekapy.py....................................

import subprocess
import os
import sys
import traceback
import time
import uuid
# Prediction class
#
# Used internally and externally to WekaPy to represent a Prediction made as
# a result of running test data through a trained classifier.
# Each prediction effectively represents the classification of a set
of instances.
class Prediction:
def __init__(self, index, observed_1, observed_2, pred_1, pred_2, error, prob):
self.index = int(index)
self.observed_category = int(observed_1)
self.observed_value = observed_2
self.predicted_category = int(pred_1)
self.predicted_value = pred_2
self.error = bool(error)
self.probability = float(prob)
def __str__(self):
return_s = str(self.index)+":\t"
return_s = return_s+"observed:
"+str(self.observed_value)+"\tpredicted:
"+str(self.predicted_value)+"\tprob: "+str(self.probability)
return return_s
# Feature class
#
# Used internally and externally to represent a feature of data.
# Each feature should contain a name and a value (for example, name =
'daylight_hours', value = 10)
# possible_values should be represented by a String type object
indicating the possible feature values
# e.g. real, {true, false}, {0,1,2}, {tom, dick, harry}, etc.
class Feature:
def __init__(self, name = None, value = None, possible_values=None):
self.name = name
self.value = value
self.possible_values = possible_values
# Instance class
#
# Used internally and externally to represent a set of Feature objects.
# Essentially, an Instance object just maintains a list of Features.
class Instance:
def __init__(self, features = None):
self.features = features
if features == None:
self.features = []
def add_feature(self, feature):
if isinstance(feature, Feature):
self.features.append(feature)
else:
raise WekapyException("Argument 'feature' must be of type Feature.")
# Model class
#
# Used externally, and is the main class for use with this library.
# The Model class should be instantiated as the first stage, from
which it can be trained
# and/or tested.
# Instantiate with a classifier_type (and any optional arguments)
class Model:
def __init__(self, classifier_type = None, max_memory = 1500, verbose = True):
if classifier_type == None or not isinstance(classifier_type, str):
raise WekapyException("A classifier type is required for construction.")
return False
if not isinstance(max_memory, int):
raise WekapyException("'max_memory' argument must be of type (int).")
return False
self.id = uuid.uuid4()
self.model_dir = "wekapy_data/models"
self.arff_dir = "wekapy_data/arff"
self.classifier = classifier_type
self.max_memory = max_memory
self.training_instances = []
self.testing_instances = []
self.predictions = []
self.verbose = verbose
self.trained = False
if not os.path.exists(self.model_dir):
os.makedirs(self.model_dir)
if not os.path.exists(self.arff_dir):
os.makedirs(self.arff_dir)
# Generate an ARFF file from a list of instances
def create_ARFF(self,instances, type):
output_arff = open(self.arff_dir+"/"+str(self.id)+"-"+type+".arff", "w")
output_arff.write("@relation "+str(self.id)+"\n")
for i, instance in enumerate(instances):
if i == 0:
for feature in instance.features:
output_arff.write("\t@attribute "+feature.name+"
"+str(feature.possible_values)+"\n")
output_arff.write("\n@data\n")
strToWrite = ""
for j, feature in enumerate(instance.features):
if j == 0:
strToWrite = strToWrite + str(feature.value)
else:
strToWrite = strToWrite + "," + str(feature.value)
output_arff.write(strToWrite+"\n")
output_arff.close()
if type == "training":
self.training_file = self.arff_dir+"/"+str(self.id)+"-"+type+".arff"
if type == "test":
self.test_file = self.arff_dir+"/"+str(self.id)+"-"+type+".arff"
# Load a model, if it exists, and set this as the currently trained
model for this
# Model instance.
def load_model(self, model_file):
if os.path.exists(model_file):
self.model_file = model_file
self.trained = True
else:
raise WekapyException("Your model could not be found.")
# Add a training instance to the model.
def add_train_instance(self, instance):
if isinstance(instance, Instance):
self.training_instances.append(instance)
else:
raise WekapyException("Argument 'instance' must be of type Instance.")
# Add a testing instance to the model.
def add_test_instance(self, instance):
if isinstance(instance, Instance):
self.testing_instances.append(instance)
else:
raise WekapyException("Argument 'instance' must be of type Instance.")
# Train the model with the chosen classifier from features in an ARFF file
def train(self, training_file = None, instances = None, save_as =
None, folds = 10):
if self.verbose: print "Training your classifier..."
start_time = time.time()
if save_as == None:
save_as = self.model_dir+"/"+str(self.id)+".model"
if len(self.training_instances) == 0: # if add_train_instance not called:
if training_file == None and instances == None:
raise WekapyException("Please provide some train instances either by
naming an ARFF train_set, providing a list of Instances, or calling
add_train_instance().")
if training_file == None:
self.create_ARFF(instances, "training")
if instances == None:
self.training_file = training_file
if len(self.training_instances) > 0: # if add_train_instance called:
if training_file == None and instances == None:
self.create_ARFF(self.training_instances, "training")
# Prioritise adding fetures passed at calltime
if training_file == None and instances is not None:
self.create_ARFF(instances, "training")
# Prioritise ARFF file passed at calltime
if instances == None and training_file is not None:
self.training_file = training_file
self.model_file = save_as
process = subprocess.Popen(["java", "-Xmx"+str(self.max_memory)+"M",
"weka.classifiers."+self.classifier, "-x", str(folds),"-t",
self.training_file, "-d", save_as], stdout=subprocess.PIPE,
stderr=subprocess.PIPE)
process_output, process_error = process.communicate()
if "Exception" in process_error:
for line in process_error.split("\n"):
if "Exception" in line:
raise WekapyException(line.split(' ',1)[1])
end_time = time.time()
self.trained = True
if self.verbose: print "Training complete (time taken = %.2fs)." %
(end_time-start_time)
# Generate predictions from the trained model from test features in an ARFF file
def test(self, test_file = None, instances = None, model_file = None):
if self.verbose: print "Generating predictions for your test set..."
start_time = time.time()
if not model_file == None:
self.load_model(model_file)
if not self.trained:
raise WekapyException("The classifier has not yet been trained. Please
call train() first")
if len(self.testing_instances) == 0:
if test_file == None and instances == None:
raise WekapyException("Please provide some test instances either by
naming an ARFF test_set, providing a list of Instances, or calling
add_test_instance().")
if test_file == None:
self.create_ARFF(instances, "test")
if instances == None:
self.test_file = test_file
if len(self.testing_instances) > 0:
if test_file == None and instances == None:
self.create_ARFF(self.testing_instances, "test")
if test_file == None and instances is not None:
self.create_ARFF(instances, "test")
if instances == None and test_file is not None:
self.test_file = test_file
process = subprocess.Popen(["java", "-Xmx"+str(self.max_memory)+"M",
"weka.classifiers."+self.classifier, "-T", self.test_file, "-l",
self.model_file, "-p", "0"], stdout=subprocess.PIPE,
stderr=subprocess.PIPE)
output, process_error = process.communicate()
if "Exception" in process_error:
for line in process_error.split("\n"):
if "Exception" in line:
raise WekapyException(line.split(' ',1)[1])
lines = output.split("\n")
instance_predictions = []
for line in lines:
pred = line.split()
if len(pred) >= 4 and not pred[0].startswith("=") and not
pred[0].startswith("inst"):
index = int(pred[0])

ob_cat = int((pred[1].split(":"))[0])

ob_val = str((pred[1].split(":"))[1])
p_cat = int((pred[2].split(":"))[0])
p_val = str((pred[2].split(":"))[1])
error = False
prob = 0.0
if "+" in pred[3]:
error = True
prob = float(pred[4])
else:
prob = float(pred[3])
prediction = Prediction(index, ob_cat, ob_val, p_cat, p_val, error, prob)
instance_predictions.append(prediction)
self.predictions = instance_predictions
end_time = time.time()
if self.verbose: print "Testing complete (time taken = %.2fs)." %
(end_time-start_time)
return instance_predictions
class WekapyException(Exception):
def __init__(self, message):
self.message = message
def __str__(self):
return self.message
Regards,
Snehal Gawade
M.Tech(Computer Science and Engineering)
NIT,Goa

> --
> You received this message because you are subscribed to a topic in the Google Groups "python-weka-wrapper" group.
> To unsubscribe from this topic, visit https://groups.google.com/d/topic/python-weka-wrapper/jA-nwM8BXT0/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to python-weka-wra...@googlegroups.com.
> To post to this group, send email to python-we...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/python-weka-wrapper/CAHoQ12L25-ZE0NfPdF2XxBhXKcQ3XhK7%3DRvT9N3XXd6ztz62_w%40mail.gmail.com.
> For more options, visit https://groups.google.com/d/optout.

Snehal Gawade

unread,

Feb 26, 2017, 2:07:10 AM2/26/17

to python-we...@googlegroups.com, frac...@waikato.ac.nz

Hello Peter,
Greetings for the day!!
Please help me for this issue. I really need this code. I have
deadline of 3 days. I am trying hard but still not getting the
results.

Actually I am working on intrusion detection. I want train OneR
classifier with udptrain.csv and wantto test it against udptest.csv
file. I am attaching both the files. Please if possible help me
through this.

I will be highly obliged.

Thank you!!

On Fri, Feb 24, 2017 at 12:02 AM, Peter Reutemann
<frac...@waikato.ac.nz> wrote:

udptest.csv

udptrain.csv

Soheyl Arab

unread,

Feb 27, 2017, 3:15:57 AM2/27/17

to python-weka-wrapper, frac...@waikato.ac.nz

Hi Snehal,

I faced with that problem on past and I cannot remember about my solution, but I have a source code that used to prediction with exported model. hope it be useful.

best rearguard.

> To unsubscribe from this group and all its topics, send an email to python-weka-wrapper+unsub...@googlegroups.com.

PredictWithModel.py

Peter Reutemann

unread,

Feb 27, 2017, 4:13:57 PM2/27/17

to python-weka-wrapper

Hi there

You're not actually using the python-weka-wrapper module, but your own
code. Sorry, but I don't provide support for that.

All the things that you're trying to achieve are already available
through the python-weka-wrapper library, which you can simply install
via Python's pip. See the examples on github:
- Python2.7
https://github.com/fracpete/python-weka-wrapper-examples
- Python3
https://github.com/fracpete/python-weka-wrapper3-examples

Soheyl's code uses the python-weka-wrapper library.

Cheers, Peter

> You received this message because you are subscribed to the Google Groups "python-weka-wrapper" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to python-weka-wra...@googlegroups.com.

> To post to this group, send email to python-we...@googlegroups.com.

> To view this discussion on the web visit https://groups.google.com/d/msgid/python-weka-wrapper/CAAxZRV9gabDrWKo_fqO9WHcwDi-m-UHhCrVBAgoUU6xPu03_YA%40mail.gmail.com.

> For more options, visit https://groups.google.com/d/optout.

Reply all

Reply to author

Forward