UDK 519.87 Vestnik SibGAU 2014, No. 3(55), P. 16–20
DATA MINING TOOLS FOR PROSPECTIVE STUDENTS’ SUCCESS RATE PREDICTION
Sh. A. Akhmedova, S. R. Vishnevskaya, A. A. Koromyslova
Siberian State Aerospace University named after academician M. F. Reshetnev 31, Krasnoyarsky Rabochy Av., Krasnoyarsk, 660014, Russian Federation E-mail: shahnaz@inbox.ru, akoromyslova@mail.ru, vishni@ngs.ru
The aim of that study was prospective students’ success rate prediction in case if they are enrolled at university. For this purpose different classification and prediction methods were used namely support vector machines (SVM), neural networks and fuzzy systems which were obtained by using genetic and bionic algorithms and their modifications. Firstly, meta-heuristic called Co-Operation of Biology Related Algorithms (COBRA), that has earlier demonstrated its usefulness on CEC’2013 real-valued optimization competition benchmark and its modification for solving constrained optimization problems, was applied to SVM-based classifiers design. Then genetic algorithm was used for neural networks and fuzzy logic systems automatic generation. Various benchmark classification problems were solved with those approaches. It was established that support vector machines, neural networks and fuzzy logic systems developed in that way outperform many alternative methods on mentioned benchmark classification problems. So the workability and usefulness of proposed classification or prediction algorithms were confirmed. After that for solving of prospective students’ success rate prediction problem, in case if they are enrolled at university, information about them obtained during the operational period of Admission Committee was gathered and preprocessed. Eventually it was established which kind of information about a prospective student is enough for determining whether he or she will or will not pass end of first semester exams in case of enrollment. Moreover, it should be noted that support vector machines generated by collective optimization method COBRA have shown the best results.
fresh students’ success rate, support vector machines, neural networks, fuzzy systems, genetic algorithms, bionic algorithms.
References

1. Bishop C. M. Theoretical foundation of neural networks. Proceedings of Physics Computing. 1996, p. 500–507.

2. Vapnik V., Chervonenkis A. Teoriya raspoznovaniya obrazov [The Theory of Pattern Recognition]. Moscow, Nauka Publ., 1974, 415 p.

3. Kruglov V. V., Dlee M. I., Golunov R. Y. Nechetkaya logika I iskusstvennye neironnye seti [Fuzzy logic and artificial neural networks]. Moscow, Phismatlit Publ., 2001, 224 p.

4. Akhmedova Sh., Shabalov A. Development and Investigation of Bio-logically Inspired Algorithms Cooperation Metaheuristic. Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO’13). 2013, p. 1417–1418.

5. Gumennikova A. V., Emel'janova M. N., Semenkin E. S., Sopov E. A. [About evolutionary algorithms for solving hard optimization problems]. Vestnik SibGAU. 2003, no. 4, p. 14–23. (In Russ.)

6. Kennedy J., Eberhart R. Particle Swarm Optimization. Proceedings of IEEE International Conference on Neural Networks. 1995, vol. IV, p. 1942–1948.

7. Yang Ch., Tu X., Chen J. Algorithm of Marriage in Honey Bees Optimization Based on the Wolf Pack Search. Proceedings of the International Conference on Intelligent Pervasive Computing. 2007, p. 462–467.

8. Yang X. S. Firefly algorithms for multimodal optimization. Proceedings of the 5th Symposium on Stochastic Algorithms, Foundations and Applications. 2009,
p. 169–178.

9. Yang X. S., Deb S. Cuckoo Search via Levy flights. Proceedings of the World Congress on Nature & Biologically Inspired Computing. 2010, p. 210–214.

10. Yang X. S. A new metaheuristic bat-inspired algorithm. Nature Inspired Cooperative Strategies for Optimization, Studies in Computational Intelligence. 2010,
vol. 284, p. 65–74.

11. Akhmedova Sh., Semenkin E. Co-Operation of Biology Related Algorithms. Proceedings of the IEEE Congress on Evolutionary Computation (CEC’2013). 2013, p. 2207–2214.

12. Semenkina M. E. [Effectiveness investigation of adaptive evolutionary algorithms for data mining information technology design]. Iskusstvennyj intellekt i prinjatie reshenij. 2013, no. 1, p. 13–23. (In Russ.)

13. Semenkin E. S., Shabalov A. A. [The system of automatic design of intelligent information technologies ensembles for data mining problems]. Programmnye produkty i sistemy. 2012, no. 4 (100), p. 70–73. (In Russ.)

14. Akhmedova Sh. A., Semenkin E. S. [New optimization metaheurastic based on co-operation of biology related algorithms]. Vestnik SibGAU. 2013, no. 4 (50), p. 92–99. (In Russ.)

15. Akhmedova Sh., Semenkin E., Sergienko R. Automatically Generated Classifiers for Opinion Mining with Different Term Weighting Schemes. Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO’2014). 2014,
vol. 2, p. 845–850.

16. Electronic textbook StatSoft. Available at: http:// www.fmi.uni-sofia.bg/fmi/statist/education/textbook/eng/ glosa.html.


Akhmedova Shakhnaz Agasuvar kyzy – Master’s Degree student, junior research fellow, Siberian State Aerospace University. E-mail: shahnaz@inbox.ru.

Vishnevskaya Sofya Romanovna – Candidate of Engineering Science, associate professor, Head of Higher Mathematics Department, Siberian State Aerospace University. E-mail: vishni@ngs.ru.

Koromyslova Alexandra Andreevna – student, Siberian State Aerospace University. E-mail: akoromyslova@mail.ru.