Really simple trick for feature selection before producing a feature importance analysis temporarily inject a number of random noise features into ones dataset

Really simple 'trick' for feature selection: before producing a feature importance analysis, temporarily inject a number of random noise features into ones dataset.

Really simple 'trick' for feature selection: before producing a feature importance analysis, temporarily inject a number of random noise features into ones dataset. Any feature that ranks BELOW the most important of these noise features, by extension, is also essentially noise and can be dropped. Indeed in small datasets one can be shockingly surprised how spuriously important these noise features can be! This technique is adapted from the 2003 paper "Ranking a Random Feature for Variable and Feature Selection" (https://lnkd.in/dNRC8s44) ---------------------------------------------------------- For more on classical ML get my book "The Orange Book of Machine Learning - Green edition" via https://lnkd.in/dZVnK67t #datascience #machinelearning | 45 comments on LinkedIn
Advertisement
Critical Question Matrix

Change leadership
Useful tips
New Marketing Tools
New Marketing Tools
Advertisement
Random INFO
Agentic IA
Ideas
𝗪𝗲 𝗻𝗲𝗲𝗱 𝘁𝗼 𝗺𝗼𝘃𝗲 𝗯𝗲𝘆𝗼𝗻𝗱 𝗰𝗮𝗹𝗹𝗶𝗻𝗴…

Comunicacion y liderazgo
Design | Engineer
Chatgpt
math and science
Advertisement
Advertisement
Advertisement
coding

Really simple 'trick' for feature selection: before producing a feature importance analysis, temporarily inject a number of random noise features into ones dataset.