MINTY: Rule-based Models that Minimize the Need for Imputing Features with Missing Values

May 3, 2024·

Lena Stempfle

Fredrik D. Johansson

· 0 min read

PDF Cite Code

Figure by Lena Stempfle

Abstract

Rule models are often preferred in prediction tasks with tabular inputs as they can be easily interpreted using natural language and provide predictive performance on par with more complex models. However, most rule models’ predictions are undefined or ambiguous when some inputs are missing, forcing users to rely on statistical imputation models or heuristics like zero imputation, undermining their interpretability. In this work, we propose fitting concise yet precise rule models that learn to avoid relying on features with missing values and therefore limit their reliance on imputation at test time. We develop MINTY, a method that learns rules in the form of disjunctions between variables that act as replacements for each other when one or more is missing. This results in a sparse linear rule model, regularized to have minimal dependence on features with missing values, allowing a trade-off between goodness of fit, interpretability, and robustness to missing values at test time. We demonstrate the value of MINTY in experiments using synthetic and real-world datasets, showing that its predictive performance is comparable or favorable to baselines while reducing reliance on missing features.

Type

Conference paper

Publication

In Proceedings of AISTATS 2024

Last updated on May 3, 2024

Authors

Lena Stempfle

Phd Candidate in Machine Learning

Sharing Pattern Submodels for Prediction with Missing Values Jun 26, 2023 →