Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1830761.1830826acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
short-paper

Classification of tumor marker values using heuristic data mining methods

Published: 07 July 2010 Publication History
  • Get Citation Alerts
  • Abstract

    Tumor markers are substances that are found in blood, urine, or body tissues and that are used as indicators for tumors; elevated tumor marker values can indicate the presence of cancer, but there can also be other causes. We have used a medical database compiled at the blood laboratory of the General Hospital Linz, Austria: Several blood values of thousands of patients are available as well as several tumor markers. We have used several data based modeling approaches for identifying mathematical models for estimating selected tumor marker values on the basis of routinely available blood values; in detail, estimators for the tumor markers AFP, CA-125, CA15-3, CEA, CYFRA, and PSA have been identified and are analyzed in this paper. The documented tumor marker values are classified as "normal" or "elevated"; our goal is to design classifiers for the respective binary classification problems. As we show in the results section, for those medical modeling tasks described here, genetic programming performs best among those techniques that are able to identify nonlinearities; we also see that GP results show less overfitting than those produced using other methods.

    References

    [1]
    M. Affenzeller and S. Wagner. SASEGASA: A new generic parallel evolutionary algorithm for achieving highest quality results. Journal of Heuristics -- Special Issue on New Advances on Parallel Meta-Heuristics for Complex Problems, 10:239--263, 2004.
    [2]
    M. Affenzeller and S. Wagner. Offspring selection: A new self-adaptive selection scheme for genetic algorithms. In B. Ribeiro, R. F. Albrecht, A. Dobnikar, D. W. Pearson, and N. C. Steele, editors, Adaptive and Natural Computing Algorithms, Springer Computer Science, pages 218--221. Springer, 2005.
    [3]
    M. Affenzeller, S. Wagner, and S. Winkler. Goal-oriented preservation of essential genetic information by offspring selection. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO) 2005, volume 2, pages 1595--1596. Association for Computing Machinery (ACM), 2005.
    [4]
    M. Affenzeller, S. Winkler, S. Wagner, and A. Beham. Genetic Algorithms and Genetic Programming -- Modern Concepts and Practical Applications. Chapman & Hall / CRC, 2009.
    [5]
    G. L. Andriole, E. D. Crawford, R. L. Grubband, S. S. Buys, D. Chia, T. R. Church, et al. Mortality results from a randomized prostate-cancer screening trial. New England Journal of Medicine, 360(13):1310--1319, 2009.
    [6]
    W. Banzhaf and C. Lasarczyk. Genetic programming of an algorithmic chemistry. In U. O'Reilly, T. Yu, R. Riolo, and B. Worzel, editors, Genetic Programming Theory and Practice II, pages 175--190. Ann Arbor, 2004.
    [7]
    N. Bitterlich and J. Schneider. Cut-off-independent tumour marker evaluation using roc approximation. Anticancer Research, 27:4305--4310, 2007.
    [8]
    C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
    [9]
    R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. Wiley Interscience, 2nd edition, 2000.
    [10]
    M. J. Duffy and J. Crown. A personalized approach to cancer treatment: how biomarkers can help. Clinical Chemistry, 54(11):1770--1779, 2008.
    [11]
    P. Gill, W. Murray, and M. Wright. Practical Optimization. Academic Press, 1982.
    [12]
    P. Gold and S. O. Freedman. Demonstration of tumor-specific antigens in human colonic carcinomata by immunological tolerance and absorption techniques. The Journal of Experimental Medicine, 121:439--462, 1965.
    [13]
    S. Hammarstrom. The carcinoembryonic antigen (cea) family: structures, suggested functions and expression in normal and malignant tissues. Seminars in Cancer Biology, 9:67--81, 1999.
    [14]
    J. A. Koepke. Molecular marker test standardization. Cancer, 69:1578--1581, 1992.
    [15]
    R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. pages 1137--1143. Morgan Kaufmann, 1995.
    [16]
    J. R. Koza. Genetic Programming: On the Programming of Computers by Means of Natural Selection. The MIT Press, 1992.
    [17]
    M. LaFleur-Brooks. Exploring Medical Language: A Student-Directed Approach. St. Louis, Missouri, USA: Mosby Elsevier, 7th edition, 2008.
    [18]
    R. S. Lai, C. C. Chen, P. C. Lee, and J. Y. Lu. Evaluation of cytokeratin 19 fragment (cyfra 21-1) as a tumor marker in malignant pleural effusion. Japanese Journal of Clinical Oncology, 29(9):421--424, 199.
    [19]
    L. Ljung. System Identification -- Theory For the User, 2nd edition. PTR Prentice Hall, Upper Saddle River, N.J., 1999.
    [20]
    G. J. Mizejewski. Alpha-fetoprotein structure and function: relevance to isoforms, epitopes, and conformational variants. Experimental biology and medicine, 226(5):377--408, 2001.
    [21]
    K. Morik, M. Imhoff, P. Brockhausen, T. Joachims, and U. Gather. Knowledge discovery and knowledge validation in intensive care. Artificial Intelligence in Medicine, 19:225--249, 2000.
    [22]
    O. Nelles. Nonlinear System Identification. Springer Verlag, Berlin Heidelberg New York, 2001.
    [23]
    Y. Niv. Muc1 and colorectal cancer pathophysiology considerations. World Journal of Gastroenterology, 14(14):2139--2141, 2008.
    [24]
    N. Osman, N. O'Leary, E. Mulcahy, N. Barrett, F. Wallis, K. Hickey, and R. Gupta. Correlation of serum ca125 with stage, grade and survival of patients with epithelial ovarian cancer at a single centre. Irish Medical Journal, 101(8):245--247, 2008.
    [25]
    A. J. Rai, Z. Zhang, J. Rosenzweig, I. ming Shih, T. Pham, E. T. Fung, L. J. Sokoll, and D.W. Chan. Proteomic approaches to tumor marker discovery. Archives of Pathology & Laboratory Medicine, 126(12):1518--1526, 2002.
    [26]
    S. J. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Prentice Hall, 2nd edition, 2003.
    [27]
    I. M. Thompson, D. K. Pauler, P. J. Goodman, C. M. Tangen, et al. Prevalence of prostate cancer among men with a prostate-specific antigen level <=4.0 ng per milliliter. New England Journal of Medicine, 350(22):2239--2246, 2004.
    [28]
    V. Vapnik. Statistical Learning Theory. Wiley, New York, 1998.
    [29]
    S. Wagner. Heuristic Optimization Software Systems -- Modeling of Heuristic Optimization Algorithms in the HeuristicLab Software Environment. PhD thesis, Johannes Kepler University Linz, 2009.
    [30]
    S. Wagner and M. Affenzeller. SexualGA: Gender-specific selection for genetic algorithms. In N. Callaos, W. Lesso, and E. Hansen, editors, Proceedings of the 9th World Multi-Conference on Systemics, Cybernetics and Informatics (WMSCI) 2005, volume 4, pages 76--81. International Institute of Informatics and Systemics, 2005.
    [31]
    P. W. Williams and H. D. Gray. Gray's anatomy. New York: C. Livingstone, 37th edition, 1989.
    [32]
    S. Winkler. Evolutionary System Identification - Modern Concepts and Practical Applications. PhD thesis, Institute for Formal Models and Verification, Johannes Kepler University Linz, 2008.
    [33]
    B. W. Yin, A. Dnistrian, and K. O. Lloyd. Ovarian cancer antigen ca125 is encoded by the muc16 mucin gene. International Journal of Cancer, 98(5):737--40, 2002.
    [34]
    K. Yonemori, M. Ando, T. S. Taro, N. Katsumata, K. Matsumoto, Y. Yamanaka, T. Kouno, C. Shimizu, and Y. Fujiwara. Tumor-marker analysis and verification of prognostic models in patients with cancer of unknown primary, receiving platinum-based combination chemotherapy. Journal of Cancer Research and Clinical Oncology, 132(10):635--642, 2006.
    [35]
    L. Zhong, X. Zhou, K. Wei, X. Yang, C. Ma, C. Zhang, and Z. Zhang. Application of serum tumor markers and support vector machine in the diagnosis of oral squamous cell carcinoma. Shanghai Kou Qiang Yi Xue (Shanghai Journal of Stomatology), 17(5):457--460, 2008.

    Cited By

    View all
    • (2015)TMDFMProceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2015.7359763(657-660)Online publication date: 9-Nov-2015
    • (2014)Data based prediction of cancer diagnoses using heterogeneous model ensemblesProceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation10.1145/2598394.2609853(1337-1344)Online publication date: 12-Jul-2014
    • (2014)On the Identification of Virtual Tumor Markers and Tumor Diagnosis Predictors Using Evolutionary AlgorithmsAdvanced Methods and Applications in Computational Intelligence10.1007/978-3-319-01436-4_6(95-122)Online publication date: 2014
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GECCO '10: Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
    July 2010
    1496 pages
    ISBN:9781450300735
    DOI:10.1145/1830761
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 July 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. classification
    2. data mining
    3. machine learning
    4. statistical analysis
    5. tumor marker data

    Qualifiers

    • Short-paper

    Conference

    GECCO '10
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)TMDFMProceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2015.7359763(657-660)Online publication date: 9-Nov-2015
    • (2014)Data based prediction of cancer diagnoses using heterogeneous model ensemblesProceedings of the Companion Publication of the 2014 Annual Conference on Genetic and Evolutionary Computation10.1145/2598394.2609853(1337-1344)Online publication date: 12-Jul-2014
    • (2014)On the Identification of Virtual Tumor Markers and Tumor Diagnosis Predictors Using Evolutionary AlgorithmsAdvanced Methods and Applications in Computational Intelligence10.1007/978-3-319-01436-4_6(95-122)Online publication date: 2014
    • (2013)Variable Interaction Networks in Medical DataInternational Journal of Privacy and Health Information Management10.4018/ijphim.20130701011:2(1-16)Online publication date: 1-Jul-2013
    • (2013)Evolutionary identification of cancer predictors using clustered dataProceedings of the 15th annual conference companion on Genetic and evolutionary computation10.1145/2464576.2466809(1463-1470)Online publication date: 6-Jul-2013
    • (2013)An Integrated Clustering and Classification Approach for the Analysis of Tumor Patient DataRevised Selected Papers of the 14th International Conference on Computer Aided Systems Theory - EUROCAST 2013 - Volume 811110.1007/978-3-642-53856-8_49(388-395)Online publication date: 10-Feb-2013
    • (2012)Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced DataIEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics10.1109/TSMCB.2011.216714442:2(406-421)Online publication date: 1-Apr-2012
    • (2011)Identification of cancer diagnosis estimation models using evolutionary algorithmsProceedings of the 13th annual conference companion on Genetic and evolutionary computation10.1145/2001858.2002040(503-510)Online publication date: 12-Jul-2011
    • (2011)Analysis of selected evolutionary algorithms in feature selection and parameter optimization for data based tumor marker modelingProceedings of the 13th international conference on Computer Aided Systems Theory - Volume Part I10.1007/978-3-642-27549-4_43(335-342)Online publication date: 6-Feb-2011

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media