Matches in SemOpenAlex for { <https://semopenalex.org/work/W2284209553> ?p ?o ?g. }
- W2284209553 abstract "DEcision-tree induction is one of the most employed methods to extract knowledge from data. There are several distinct strategies for inducing decision trees from data, each one presenting advantages and disadvantages according to its corresponding inductive bias. These strategies have been continuously improved by researchers over the last 40 years. This thesis, following recent breakthroughs in the automatic design of machine learning algorithms, proposes to automatically generate decision-tree induction algorithms. Our proposed approach, namely HEAD-DT, is based on the evolutionary algorithms paradigm, which improves solutions based on metaphors of biological processes. HEAD-DT works over several manually-designed decision-tree components and combines the most suitable components for the task at hand. It can operate according to two different frameworks: i) evolving algorithms tailored to one single data set (specific framework); and ii) evolving algorithms from multiple data sets (general framework). The specific framework aims at generating one decision-tree algorithm per data set, so the resulting algorithm does not need to generalise beyond its target data set. The general framework has a more ambitious goal, which is to generate a single decision-tree algorithm capable of being effectively applied to several data sets. The specific framework is tested over 20 UCI data sets, and results show that HEAD-DT’s specific algorithms outperform algorithms like CART and C4.5 with statistical significance. The general framework, in turn, is executed under two different scenarios: i) designing a domain-specific algorithm; and ii) designing a robust domain-free algorithm. The first scenario is tested over 35 microarray gene expression data sets, and results show that HEAD-DT’s algorithms consistently outperform C4.5 and CART in different experimental configurations. The second scenario is tested over 67 UCI data sets, and HEAD-DT’s algorithms were shown to be competitive with C4.5 and CART. Nevertheless, we show that HEAD-DT is prone to a special case of overfitting when it is executed under the second scenario of the general framework, and we point to possible alternatives for solving this problem. Finally, we perform an extensive experiment for evaluating the best single-objective fitness function for HEAD-DT, combining 5 classification performance measures with three aggregation schemes. We evaluate the 15 fitness functions in 67 UCI data sets, and the best of them are employed to generate algorithms tailored to balanced and imbalanced data. Results show that the automatically-designed algorithms outperform CART and C4.5 with statistical significance, indicating that HEAD-DT is also capable of generating custom algorithms for data with a particular kind of statistical profile." @default.
- W2284209553 created "2016-06-24" @default.
- W2284209553 creator A5039629929 @default.
- W2284209553 date "2015-11-19" @default.
- W2284209553 modified "2023-09-26" @default.
- W2284209553 title "On the automatic design of decision-tree induction algorithms" @default.
- W2284209553 cites W107445549 @default.
- W2284209553 cites W116375701 @default.
- W2284209553 cites W1231053678 @default.
- W2284209553 cites W132782689 @default.
- W2284209553 cites W148291325 @default.
- W2284209553 cites W1485092523 @default.
- W2284209553 cites W1487658218 @default.
- W2284209553 cites W1488832888 @default.
- W2284209553 cites W1494213040 @default.
- W2284209553 cites W1494771723 @default.
- W2284209553 cites W1497256448 @default.
- W2284209553 cites W1498522186 @default.
- W2284209553 cites W1501753568 @default.
- W2284209553 cites W1503303035 @default.
- W2284209553 cites W1507079426 @default.
- W2284209553 cites W1507234839 @default.
- W2284209553 cites W1510523207 @default.
- W2284209553 cites W1510671410 @default.
- W2284209553 cites W1512383952 @default.
- W2284209553 cites W1513386654 @default.
- W2284209553 cites W1513389696 @default.
- W2284209553 cites W1515620500 @default.
- W2284209553 cites W1519169075 @default.
- W2284209553 cites W1521475723 @default.
- W2284209553 cites W1522701493 @default.
- W2284209553 cites W1523600104 @default.
- W2284209553 cites W1524188521 @default.
- W2284209553 cites W1525844566 @default.
- W2284209553 cites W1527083469 @default.
- W2284209553 cites W1527480195 @default.
- W2284209553 cites W1534707631 @default.
- W2284209553 cites W153647241 @default.
- W2284209553 cites W1539097253 @default.
- W2284209553 cites W1542525373 @default.
- W2284209553 cites W1544969377 @default.
- W2284209553 cites W1546917456 @default.
- W2284209553 cites W1548531254 @default.
- W2284209553 cites W1548779692 @default.
- W2284209553 cites W1550451942 @default.
- W2284209553 cites W1553373771 @default.
- W2284209553 cites W1554944419 @default.
- W2284209553 cites W1559912706 @default.
- W2284209553 cites W1559950192 @default.
- W2284209553 cites W1559996711 @default.
- W2284209553 cites W1562117546 @default.
- W2284209553 cites W1563072111 @default.
- W2284209553 cites W1564780186 @default.
- W2284209553 cites W1564947197 @default.
- W2284209553 cites W1565338878 @default.
- W2284209553 cites W1565377632 @default.
- W2284209553 cites W1565746575 @default.
- W2284209553 cites W1567797675 @default.
- W2284209553 cites W1568834902 @default.
- W2284209553 cites W1570896421 @default.
- W2284209553 cites W1572978214 @default.
- W2284209553 cites W1576818901 @default.
- W2284209553 cites W1577506768 @default.
- W2284209553 cites W1580617490 @default.
- W2284209553 cites W1582082363 @default.
- W2284209553 cites W1588435133 @default.
- W2284209553 cites W1589187426 @default.
- W2284209553 cites W1593354505 @default.
- W2284209553 cites W1594031697 @default.
- W2284209553 cites W1602306524 @default.
- W2284209553 cites W1602363634 @default.
- W2284209553 cites W1602699467 @default.
- W2284209553 cites W1603456912 @default.
- W2284209553 cites W1604329830 @default.
- W2284209553 cites W1605906394 @default.
- W2284209553 cites W1639032689 @default.
- W2284209553 cites W1642657809 @default.
- W2284209553 cites W1745341198 @default.
- W2284209553 cites W1780185704 @default.
- W2284209553 cites W1783003885 @default.
- W2284209553 cites W1808644423 @default.
- W2284209553 cites W182248535 @default.
- W2284209553 cites W1825578079 @default.
- W2284209553 cites W1825843584 @default.
- W2284209553 cites W1839292360 @default.
- W2284209553 cites W185934748 @default.
- W2284209553 cites W1862552409 @default.
- W2284209553 cites W1871423930 @default.
- W2284209553 cites W1874063393 @default.
- W2284209553 cites W1878134432 @default.
- W2284209553 cites W1929858224 @default.
- W2284209553 cites W1935550669 @default.
- W2284209553 cites W1963838563 @default.
- W2284209553 cites W1965120561 @default.
- W2284209553 cites W1966253115 @default.
- W2284209553 cites W1970074386 @default.
- W2284209553 cites W1972953764 @default.
- W2284209553 cites W1976123439 @default.
- W2284209553 cites W1977997977 @default.
- W2284209553 cites W1978552298 @default.