Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Advertisement

Improving the genetic bee colony optimization algorithm for efficient gene selection in microarray data

  • Regular Paper
  • Published:
Progress in Artificial Intelligence Aims and scope Submit manuscript

Abstract

Feature selection is a very critical component in the workflow of biomedical data mining applications. In particular, there is a need for feature selection methods that can find complex relationships among genes, yet computationally efficient. Within the scope of microarray data analysis, the genetic bee colony (Gbc) algorithm is one of the best feature selection algorithms, which leverages the combination between genetic and ant colony optimization algorithms to search for the optimal solution. In this paper, we analyse in depth the fundamentals lying behind the Gbc and propose some improvements in both efficiency and accuracy, so that researchers can even take more advantage of this excellent method. By (i) replacing the filtering phase of Gbc with a more efficient technique, (ii) improving the population generation in the artificial colony algorithm used in Gbc, and (iii) improving the exploitation method in Gbc, our experiments in microarray data sets reveal that our new method Gbc+ is not only significantly more accurate, but also around ten times faster on average than the original

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Explore related subjects

Find the latest articles, discoveries, and news in related topics.

References

  1. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157 (2003)

    MATH  Google Scholar 

  2. Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artif. Intell. 97(1), 273 (1997)

    Article  Google Scholar 

  3. Alshamlan, H.M., Badr, G.H., Alohali, Y.A.: Genetic bee colony (GBC) algorithm: A new gene selection method for microarray cancer classification. Comput. Biol. Chem. 56, 49 (2015)

    Article  Google Scholar 

  4. Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226 (2005)

    Article  Google Scholar 

  5. Ramrez-Gallego, S., Lastra, I., Martnez-Rego, D., Boln-Canedo, V., Bentez, J.M., Herrera, F., AlonsoBetanzos, A.: FastmRMR: fast minimum redundancy maximum relevance algorithm for high-dimensional big data. Int. J. Intell. Syst. 32, 134–152 (2017). https://doi.org/10.1002/int.21833

    Article  Google Scholar 

  6. Pino, A., Shin, K.: Mrmr+ and Cfs+ feature selection algorithms for high-dimensional data. submitted to Applied Intelligence (2018) (under review)

  7. Xiang, W.L., An, M.Q.: An efficient and robust artificial bee colony algorithm for numerical optimization. Comput. Oper. Res. 40(5), 1256 (2013)

    Article  MathSciNet  Google Scholar 

  8. Karaboga, D., Akay, B.: A comparative study of artificial bee colony algorithm. Appl. Math. Comput. 214(1), 108 (2009)

    MathSciNet  MATH  Google Scholar 

  9. Wojnarski, M., Janusz, A., Nguyen, H.S., Bazan, J., Luo, C., Chen, Z., Hu, F., Wang, G., Guan, L., Luo, H., Gao, J., Shen, Y., Nikulin, V., Huang, T.H., McLachlan, G.J., Bošnjak, M., Gamberger, D.: RSCTC’2010 discovery challenge: mining DNA microarray data for medical diagnosis and treatment. In: Szczuka, M., Kryszkiewicz, M., Ramanna, S., Jensen, R., Hu, Q. (eds.) Rough Sets and Current Trends in Computing. 7th International Conference, RSCTC 2010, Warsaw, Poland, June 28–30, 2010 Proceedings, pp. 4–19. Springer, Berlin (2010)

    Chapter  Google Scholar 

  10. El Akadi, A., Amine, A., El Ouardighi, A., Aboutajdine, D.: A two-stage gene selection scheme utilizing MRMR filter and GA wrapper. Knowl. Inf. Syst. 26(3), 487 (2011)

    Article  Google Scholar 

  11. Osama, A., Tajudin, A., Azmi, M., Mohammad, L.: Gene selection for cancer classification by combining minimum redundancy maximum relevancy and bat-inspired algorithm. Int. J. Data Min. Bioinf. 19(1), 32 (2017)

    Article  Google Scholar 

  12. Moraglio, A., Di Chio, C., Poli, R.: Geometric particle swarm optimization. In: Ebner, M., O’Neill, M., Ekárt, A., Vanneschi, L., Esparcia-Alcázar, A.I. (eds.) Genetic Programming, 10th European Conference, EuroGP 2007, Valencia, Spain, April 11–13, 2007, Proceedings. Lecture Notes in Computer Science, vol. 5481, pp. 125–136. Springer, Berlin (2007)

    Google Scholar 

  13. Witten, I., Frank, E., Hall, M., Pal, C.: Data Mining: Practical Machine Learning Tools and Techniques. The Morgan Kaufmann Series in Data Management Systems. Elsevier, New York (2016)

    Google Scholar 

  14. Vanschoren, J., van Rijn, J.N., Bischl, B., Torgo, L.: OpenML: networked science in machine learning. SIGKDD Explor. 15(2), 49 (2013)

    Article  Google Scholar 

  15. Wojnarski, M., Stawicki, S., Wojnarowski, P.: TunedIT.org: system for automated evaluation of algorithms in repeatable experiments. In: Szczuka, M., Kryszkiewicz, M., Jensen, R., Hu, Q. (eds.) Rough Sets and Current Trends in Computing, 7th International Conference, RSCTC 2010, Warsaw, Poland, June 28-30, 2010 Proceedings, pp. 12 (2011)

Download references

Acknowledgements

This work was partially supported by the Grant-in-Aid for Scientific Research (JSPS KAKENHI Grant Number 17H00762) from the Japan Society for the Promotion of Science.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adrian Pino Angulo.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pino Angulo, A., Shin, K. & Velázquez-Rodríguez, C. Improving the genetic bee colony optimization algorithm for efficient gene selection in microarray data. Prog Artif Intell 7, 399–410 (2018). https://doi.org/10.1007/s13748-018-0161-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13748-018-0161-9

Keywords