I am new to data science & machine learning.
I am using Weka platform to work on a classification problem with an imbalanced dataset. I want to apply the following:
1) a cross-validation technique to the imbalanced dataset.
2) a feature selection method on the same dataset but after balancing it using oversampling technique.
Is it correct to do the above by this scenario:
Assume that I had an imbalanced dataset with 5 features: a, b, c, d, and e features. I balanced the dataset using oversampling technique. Then I applied a feature selection method on the entire dataset. Then, I got three selected features: a, b, and c. After that, I went back to the imbalanced dataset (the original one) and removed d and e features. Then I completed my procedures on the imbalanced dataset with a, b, and c features (using FilteredClassifier and MultiFilter to apply cross-validation + oversampling + a classifier)