Failed data processing

13 views
Skip to first unread message

Josh

unread,
Dec 28, 2012, 2:45:59 PM12/28/12
to mlc...@googlegroups.com
Hello,
I have just started with mlcomp.org, and I'm running into an issue with the first processing step for the data (Inspect Split Strip). I have some continuous data sets. There are 57 lines in the training set and 5186 in the test set. I've used an SVM for binary classification using R, but I'd like to see what some other algorithms can achieve with mlcomp. Does anyone have some advice on what I can fix?
Thanks!

=== START program2: ./run inspect ../dataset1/train
Traceback (most recent call last):
  File "./run", line 89, in <module>
    inspect(sys.argv[2])
  File "./run", line 31, in inspect
    y = int(y)
ValueError: invalid literal for int() with base 10: '-1,1:0.0677988,2:9.10265,3:0.015101,4:4.00298'
=== END program2: ./run inspect ../dataset1/train --- FAILED [0s]
Failed: Inspecting train datashard failed.

head train:
-1,1:0.0677988,2:9.10265,3:0.015101,4:4.00298
-1,1:39.3518,2:55.8266,3:54.1971,4:306.397
1,1:3.00313,2:0.311742,3:29.0759,4:5.59234
-1,1:25.8011,2:6.47332,3:0,4:0.0975788
-1,1:7.76623,2:62.5829,3:0,4:0
-1,1:3.78907,2:2.14078,3:1.08095,4:12.655
1,1:46.7778,2:0.577116,3:0.122145,4:1.02428
-1,1:1.14223,2:5.76031,3:2.64855,4:23.7908
1,1:713.41,2:179.671,3:543.475,4:271.076
1,1:1.11077,2:0.137706,3:12.7526,4:0.146553

head test:
1:0.0920242,2:0.570197,3:1.85306,4:0.593986
1:0,2:0,3:8.21495,4:19.0214
1:1.22572,2:0,3:0.977919,4:0.148526
1:0,2:0.0701989,3:0,4:4.59519
1:0.323363,2:0.805351,3:0,4:1.19509
1:0,2:7.28946,3:0,4:0.843023
1:0,2:49.8327,3:0,4:0
1:0,2:0.12655,3:0,4:8.82058
1:2.51452,2:0.4295,3:2.77731,4:2.86104
1:0.0107031,2:2.61035,3:0.00843482,4:4.13331

Percy Liang

unread,
Dec 28, 2012, 3:52:43 PM12/28/12
to mlc...@googlegroups.com
Your file format is incorrect. Use spaces rather than commas.

-Percy
Reply all
Reply to author
Forward
0 new messages