Invention Title:

PROBABILISTIC BLACK-BOX ANOMALY ATTRIBUTION

Publication number:

US20250335800

Publication date:
Section:

Physics

Class:

G06N7/01

Inventors:

Assignee:

Applicant:

Smart overview of the Invention

A novel approach is introduced for understanding black-box machine learning models through a process called probabilistic black-box anomaly attribution. This method leverages a framework known as "generative perturbation analysis" to identify and analyze anomalies within test data. The focus is on determining the responsibility of individual variables for these anomalies, providing insights into the factors influencing the model's outputs.

Background

Machine learning models have become increasingly prevalent, yet their opacity has sparked concerns, leading to a demand for explainable artificial intelligence (XAI). While initial XAI efforts focused on psychological aspects, current research emphasizes practical applications in business and industry. A key challenge is identifying the influence of each input when a model's prediction significantly differs from observed outcomes, known as anomaly attribution.

Methodology

The process involves generating a variable distribution by applying multiple perturbations to test data, which aids in understanding the model's behavior under various conditions. An attribution score is then calculated to quantify each variable's impact on an anomaly. This score provides a clear metric for assessing the significance of different variables, allowing for more informed decision-making.

Technical Details

The method includes using expected value estimation with an estimated local gradient and a sparsity constraint to balance model accuracy and complexity. Anomalies are identified by calculating negative natural logarithms of conditional probabilities, enhancing anomaly detection efficiency. Variational Bayesian inference is employed to approximate complex probability distributions, facilitating faster and more scalable evaluations.

Applications and Benefits

This approach offers a comprehensive understanding of black-box models, particularly useful in fields like real estate where property values are influenced by numerous factors. By determining the most significant variables affecting anomalies, analysts can make better recommendations. The system's ability to quantify uncertainty in attribution scores further enhances its applicability in industrial contexts.