JavaScript NENÍ povolen !

Prosím povolte JavaScript.

Článek

Medvik - BMČ

Je něco špatně v tomto záznamu ?

Oblique and rotation double random forest

MA. Ganaie, M. Tanveer, PN. Suganthan, V. Snasel

Ganaie, M A
Autor Ganaie, M A Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, India. Electronic address: phd1901141006@iiti.ac.in
Tanveer, M
Autor Tanveer, M Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, India. Electronic address: mtanveer@iiti.ac.in
Suganthan, P N
Autor Suganthan, P N School of Electrical & Electronic Engineering, Nanyang Technological University, Singapore KINDI Center for Computing Research, College of Engineering, Qatar University, Qatar. Electronic address: epnsugan@ntu.edu.sg
Snasel, V
Autor Snasel, V Department of Computer Science, VŠB - Technical University of Ostrava, Czech Republic. Electronic address: vaclav.snasel@vsb.cz

Neural networks. 2022 ; 153 (-) : 496-517. [pub] 20220618

Neural Netw
ISSN 1879-2782
Medvik
Zdroj

Jazyk angličtina Země Spojené státy americké

Typ dokumentu časopisecké články

Perzistentní odkaz https://www.medvik.cz/link/bmc22024652

PubMed 35816861
DOI 10.1016/j.neunet.2022.06.012
Knihovny.cz E-zdroje

Random Forest is an ensemble of decision trees based on the bagging and random subspace concepts. As suggested by Breiman, the strength of unstable learners and the diversity among them are the ensemble models' core strength. In this paper, we propose two approaches known as oblique and rotation double random forests. In the first approach, we propose rotation based double random forest. In rotation based double random forests, transformation or rotation of the feature space is generated at each node. At each node different random feature subspace is chosen for evaluation, hence the transformation at each node is different. Different transformations result in better diversity among the base learners and hence, better generalization performance. With the double random forest as base learner, the data at each node is transformed via two different transformations namely, principal component analysis and linear discriminant analysis. In the second approach, we propose oblique double random forest. Decision trees in random forest and double random forest are univariate, and this results in the generation of axis parallel split which fails to capture the geometric structure of the data. Also, the standard random forest may not grow sufficiently large decision trees resulting in suboptimal performance. To capture the geometric properties and to grow the decision trees of sufficient depth, we propose oblique double random forest. The oblique double random forest models are multivariate decision trees. At each non-leaf node, multisurface proximal support vector machine generates the optimal plane for better generalization performance. Also, different regularization techniques (Tikhonov regularization, axis-parallel split regularization, Null space regularization) are employed for tackling the small sample size problems in the decision trees of oblique double random forest. The proposed ensembles of decision trees produce trees with bigger size compared to the standard ensembles of decision trees as bagging is used at each non-leaf node which results in improved performance. The evaluation of the baseline models and the proposed oblique and rotation double random forest models is performed on benchmark 121 UCI datasets and real-world fisheries datasets. Both statistical analysis and the experimental results demonstrate the efficacy of the proposed oblique and rotation double random forest models compared to the baseline models on the benchmark datasets.

Department of Computer Science VŠB Technical University of Ostrava Czech Republic

Department of Mathematics Indian Institute of Technology Indore Simrol Indore 453552 India

KINDI Center for Computing Research College of Engineering Qatar University Qatar

School of Electrical and Electronic Engineering Nanyang Technological University Singapore

Citace poskytuje Crossref.org

000: 00000naa a2200000 a 4500

001: bmc22024652

003: CZ-PrNML

005: 20221031100755.0

007: ta

008: 221017s2022 xxu f 000 0|eng||

009: AR

024 7_: $a 10.1016/j.neunet.2022.06.012 $2 doi

035 __: $a (PubMed)35816861

040 __: $a ABA008 $b cze $d ABA008 $e AACR2

041 0_: $a eng

044 __: $a xxu

100 1_: $a Ganaie, M A $u Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, India. Electronic address: phd1901141006@iiti.ac.in

245 10: $a Oblique and rotation double random forest / $c MA. Ganaie, M. Tanveer, PN. Suganthan, V. Snasel

520 9_: $a Random Forest is an ensemble of decision trees based on the bagging and random subspace concepts. As suggested by Breiman, the strength of unstable learners and the diversity among them are the ensemble models' core strength. In this paper, we propose two approaches known as oblique and rotation double random forests. In the first approach, we propose rotation based double random forest. In rotation based double random forests, transformation or rotation of the feature space is generated at each node. At each node different random feature subspace is chosen for evaluation, hence the transformation at each node is different. Different transformations result in better diversity among the base learners and hence, better generalization performance. With the double random forest as base learner, the data at each node is transformed via two different transformations namely, principal component analysis and linear discriminant analysis. In the second approach, we propose oblique double random forest. Decision trees in random forest and double random forest are univariate, and this results in the generation of axis parallel split which fails to capture the geometric structure of the data. Also, the standard random forest may not grow sufficiently large decision trees resulting in suboptimal performance. To capture the geometric properties and to grow the decision trees of sufficient depth, we propose oblique double random forest. The oblique double random forest models are multivariate decision trees. At each non-leaf node, multisurface proximal support vector machine generates the optimal plane for better generalization performance. Also, different regularization techniques (Tikhonov regularization, axis-parallel split regularization, Null space regularization) are employed for tackling the small sample size problems in the decision trees of oblique double random forest. The proposed ensembles of decision trees produce trees with bigger size compared to the standard ensembles of decision trees as bagging is used at each non-leaf node which results in improved performance. The evaluation of the baseline models and the proposed oblique and rotation double random forest models is performed on benchmark 121 UCI datasets and real-world fisheries datasets. Both statistical analysis and the experimental results demonstrate the efficacy of the proposed oblique and rotation double random forest models compared to the baseline models on the benchmark datasets.

650 12: $a algoritmy $7 D000465

650 _2: $a analýza hlavních komponent $7 D025341

650 _2: $a rotace $7 D012399

650 12: $a support vector machine $7 D060388

655 _2: $a časopisecké články $7 D016428

700 1_: $a Tanveer, M $u Department of Mathematics, Indian Institute of Technology Indore, Simrol, Indore, 453552, India. Electronic address: mtanveer@iiti.ac.in

700 1_: $a Suganthan, P N $u School of Electrical & Electronic Engineering, Nanyang Technological University, Singapore; KINDI Center for Computing Research, College of Engineering, Qatar University, Qatar. Electronic address: epnsugan@ntu.edu.sg

700 1_: $a Snasel, V $u Department of Computer Science, VŠB - Technical University of Ostrava, Czech Republic. Electronic address: vaclav.snasel@vsb.cz

773 0_: $w MED00011811 $t Neural networks $x 1879-2782 $g Roč. 153, č. - (2022), s. 496-517

856 41: $u https://pubmed.ncbi.nlm.nih.gov/35816861 $y Pubmed

910 __: $a ABA008 $b sig $c sign $y p $z 0

990 __: $a 20221017 $b ABA008

991 __: $a 20221031100752 $b ABA008

999 __: $a ok $b bmc $g 1854405 $s 1175942

BAS __: $a 3

BAS __: $a PreBMC

BMC __: $a 2022 $b 153 $c - $d 496-517 $e 20220618 $i 1879-2782 $m Neural networks $n Neural Netw $x MED00011811

LZP __: $a Pubmed-20221017

Najít záznam

v PubMed

Citační ukazatele

Pouze přihlášení uživatelé

Oblique and rotation double random forest

Najít záznam

Citační ukazatele

Možnosti archivace