Cancer Classification using Data Mining Applications | ||||
The International Undergraduate Research Conference | ||||
Article 24, Volume 2, Second International Undergraduate Research Conference, IUGRC, 2017, Page 129-129 PDF (510.13 K) | ||||
Document Type: Original Article | ||||
DOI: 10.21608/iugrc.2017.90641 | ||||
View on SCiNiTO | ||||
Authors | ||||
Ahmed Ramadan Youssef1; Ahmed Adel Ragab1; Aya Mohamed Sadek1; Shima Moustafa Korany1; Abdelrahman Gamal Youssef1; Abdelrahman Ibrahim1; Rania Ahmed Abdel Azeem Abul Seoud2; Dina Ahmed Salem3 | ||||
1Fayoum University, Egypt. | ||||
2Department of Electrical Engineering, Communication and Electronics Section; Faculty of Engineering, Fayoum University; Fayoum, Egypt. | ||||
3Dept. of Computer Eng.-Faculty of Eng. Misr University for Science and Technology, 6th of October, Giza, Egypt. | ||||
Abstract | ||||
The correct interpretation of the biological data is the main goal of Bioinformatics. One emerging and reliable source of data is the microarray technology which is considered a breakthrough in Bioinformatics. Cancer classification using microarray data is a challenge due to the enormous number of features compared to the samples. In the current work, an algorithm was developed in order to classify cancer samples. The developed algorithm was conducted on two steps. In the first step, the feature selection technique was applied on the data to eliminate any undesired features of little or no predictive information. The feature selection technique was based on Entropy and F-score measurements. Then, the classification process was performed using linear support vector machine (SVM), K-Nearest Neighbor (KNN) and Naive Bayes (NB) algorithms, the results achieved were 100% using Naive Bayes, 97% using Linear SVM and 94% using KNN on leukemia dataset .The ability of the developed algorithm for classifying the samples was practically examined using leukemia microarray dataset. The results showed that the developed algorithm could detect and classify all the samples. Then we generalized the algorithm to be applied on different microarray datasets such as Prostate and Colon. | ||||
Statistics Article View: 141 PDF Download: 85 |
||||