Gene-Disease Association | ||||
Mansoura Journal for Computer and Information Sciences | ||||
Volume 16, Issue 2, December 2020, Page 1-9 PDF (1.51 MB) | ||||
Document Type: Original Research Articles. | ||||
DOI: 10.21608/mjcis.2020.321071 | ||||
View on SCiNiTO | ||||
Authors | ||||
E. E. Abdelbadeea; M. A. El-Dosuky; M. Z. Rashad | ||||
Faculty of Computers and Information, Computer Science Dept. Mansoura University, Egypt | ||||
Abstract | ||||
Disease susceptibility prediction is defined as follows. Given training set S and a test case t∉S as a tuple (known as SNP, unknown disease), trying predicting the unknown disease with maximum accuracy. DisGeNET is a proponent dataset in disease susceptibility research. This paper reviews DisGeNET comprehensive information, before introducing a proposed system operating atop it. First, vetting the dataset by consolidation, and removing genes with effects beyond a certain threshold. Second, computing the empirical cumulative distribution function, using it for plotting and printing gene associations for many diseases such as, and not limited to, Alzheimer, Anemia, and Brain, breast cancer proposed methods such as applying C4.5 & naïve Bayes give better accuracy then previous works | ||||
Keywords | ||||
DNA analysis; epidemiological; DisGeNET; DNA Disease susceptibility; and disease susceptibility prediction | ||||
Statistics Article View: 31 PDF Download: 33 |
||||