Evolutionary Computation for Feature Selection and Feature Construction
Description
In data mining/big data and machine learning, many real-world problems such as bio-data classification and biomarker detection, image analysis, text mining often involve a large number of features/attributes. However, not all the features are essential since many of them are redundant or even irrelevant, and the useful features are typically not equally important. Using all the features for classification or other data mining tasks typically does not produce good results due to the big dimensionality and the large search space. This problem can be solved by feature selection to select a small subset of original (relevant) features or feature construction to create a smaller set of high-level features using the original low-level features.
Feature selection and construction are very challenging tasks due to the large search space and feature interaction problems. Exhaustive search for the best feature subset of a given dataset is practically impossible in most situations. A variety of heuristic search techniques have been applied to feature selection and construction, but most of the existing methods still suffer from stagnation in local optima and/or high computational cost. Due to the global search potential and heuristic guidelines, evolutionary computation techniques such as genetic algorithms, genetic programming, particle swarm optimisation, ant colony optimisation, differential evolution and evolutionary multi-objective optimisation have been recently used for feature selection and construction for dimensionality reduction, and achieved great success. Many of these methods only select/construct a small number of important features, produce higher accuracy, and generated small models that are easy to understand/interpret and efficient to run on unseen data. Evolutionary computation techniques have now become an important means for handling big dimensionality issues where feature selection and construction are required. Furthermore, feature selection and dimensionality reduction has also been a main approach to explainable machine learning and interpretable AI.
The tutorial will introduce the general framework within which evolutionary feature selection and construction can be studied and applied, sketching a schematic taxonomy of the field and providing examples of successful real-world applications. The application areas to be covered will include bio-data classification and biomarker detection, image analysis and pattern classification, symbolic regression, network security and intrusion detection, and text mining. EC techniques to be covered will include genetic algorithms, genetic programming, particle swarm optimisation, differential evolution, ant colony optimisation, artificial bee colony optimisation, and evolutionary multi-objective optimisation. We will show how such evolutionary computation techniques (with a focus on particle swarm optimisation and genetic programming) can be effectively applied to feature selection/construction and dimensionality reduction and provide promising results.
Organizers
Bing Xue is a Fellow of IEEE, Fellow of Engineering New Zealand, currently Professor of Artificial Intelligence, Deputy Head of School in the School of Engineering and Computer Science at VUW, Deputy Director of Centre for Data Science and Artificial Intelligence at VUW. Her research focuses mainly on evolutionary computation, machine learning, big data, feature selection/learning, evolving neural networks, explainable AI and their real-world applications. Bing has over 400 papers published in fully refereed international journals and conferences including many highly cited papers and top most popular papers.
Bing is Chair of IEEE CIS Task Force on Evolutionary Deep Learning and Applications, vice-chair of IEEE CIS Task Force on Transfer Learning and Transfer Optimisation. Bing has also been served as an Associate/Guest Editor or Editorial Board Member for > 10 international journals, including IEEE TEVC, ACM TELO, IEEE TETCI, and IEEE TAI. She is a key organiser for many international conferences, e.g. General Chair of PRICAI 2025, Conference Chair of EuroGP 2025, IEEE CEC 2024, EuroGP 2024, Co-ambassador for Women in Data Science NZ 2025, 2024, and 2023, Panel Chair and Conflict-of-Interest Chair for IEEE CEC 2023, Tutorial Chair for IEEE WCCI 2022, Publication Chair of EuroGP 2022, Track Chair for ACM GECCO 2019-2022, Workshop Chair for IEEE ICDM 2021, Conference Activities Chair for IEEE SSCI 2021, Publicity Chair for IEEE CEC 2021, General Co-Chair of IVCNZ 2020, Program Co-Chair for KETO 2020, Senior PC of IJCAI 2019-2021, Finance Chair of IEEE CEC 2019, Program Chair of Austrasia AI 2018, IEEE CIS FASLIP Symposium founder and Chair since 2016, and others in international conferences.