Utility Driven Approach to Learn Credible Interpretations of Machine Learning Models
Implementing Organization
Indian Institute of Technology (IIT)
Principal Investigator
Dr. Koustav Rudra
Indian Institute Of Technology (IIT) Kharagpur, West Bengal
About
Explainability is crucial in deep learning models, as posthoc explanation methods often focus on local interpretations and cannot explain the global behavior of the models. This project aims to introduce explainability/interpretability as a part of model development and use it as a constraint or regularizer to control the model learning process. Previous studies have not explored the relationships among words present in text, which are helpful in understanding causality and the model's decision-making process. The project plans to develop interpretable graphical models based on an annotated dataset and optimize the task and explanation jointly. A multi-task learning approach could play a significant role in training such models, as it allows exploring multiple related tasks that benefit from each other. The main focus is to explore the nonlinear structure of input and learn explanation tokens and relations among them in a joint fashion. Existing datasets contain tokens but not the relations. To overcome the scarcity of good quality explanation data, the project will explore different learning strategies, such as active, fidelity-weighted learning, to overcome the limitation of training data. The study aims to discover explanation tokens and apply a continuous learning setup to update the model iteratively. Multiple stakeholders of interpretability of machine learning models exist, including system developers who need ranked lists of training examples for bug detection and quick model error fixes, and end users who seek important features, decision paths, and text snippets/pixels responsible for prediction. The project will design a utility function to measure the importance of training instances and a metric to quantify interpretability and human experience.
Patents
0
Source
Source
Science and Engineering Research Board (SERB), DST 2022-23
Science and Engineering Research Board (SERB), New Delhi
Anusandhan National Research Foundation (ANRF)
Quick Information
Area of Research
Engineering Sciences
Start Year
2022
End Year
2024
Sanction Amount
₹ 30.90 L
Status
Completed
Contact
krudra5@gmail.com
Output
No. of Research Paper
00
Technologies (If Any)
00
No. of PhD Produced
00
No. of Patents
Filed :00
Grant :00
Disclaimer:
Information available on this portal is sourced from various organizations and is provided for informational purposes only. Users are advised to verify details from the respective official sources.
Please enter your details
Please provide your name and email to continue. Your details are saved in this browser for future use.
Latest Updates
Loading…
⚠️
You are leaving this website
You are about to be redirected to an external website that is not operated by
India Science, Technology & Innovation (ISTI) Portal.