Penanganan Data Tidak Seimbang pada Pemodelan Rotation Forest Keberhasilan Studi Mahasiswa Program Magister IPB
Graduate school of Bogor Agricultural University (SPs-IPB) stated that not all students of IPB master program successfully complete their studies. This becomes an evaluation for IPB to be more selective in choosing students in the future. This study aims to model the success classification of IPB master students in 2011 to 2015. The classification method used is rotation forest. The percentage of students who graduated is very large compared to those who did not pass, this can cause the evaluation value different. SMOTE (Synthetic Minority Oversampling Technique) is one of method to handle such unbalanced data by generating artificial data. The ROC (Receiver Operating Characteristic) curve is built to see the optimum cut off value. There are two classification models, they are rotation forest models before and after handled by SMOTE. The comparison results show that the rotation forest model after SMOTE with cut off value 0.6 is the best model. This model can increase the sensitivity value more than 50% although the accuracy and specificity value decreased compared to the modeling before SMOTE.