Automated selection of particle-jet features for data analysis in High Energy Physics experiments
We show that it is possible to reduce the size of a classification problem by automatically ranking the relative importance of available features.
Variables are importance-sorted with a decision tree algorithm and correlated ones are removed after ranking. The selected features can be used as input quantities for the classification problem at hand.
We have tested the method with the case of highly boosted di-jet resonances decaying to two $b$-quarks, to be selected against an overwhelming QCD background with a Deep Neural network. We make it explicit the relation between different importance rankings obtained with different algorithms. We also show how the signal-to-background ratio changes, varying the number of features to feed the Neural Network with.
How to cite
Metadata are provided both in "article" format (very similar to INSPIRE) as this helps creating
very compact bibliographies which can be beneficial to authors and
readers, and in "proceeding" format
which is more detailed and complete.