Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution

Kirill S. Lukyanov; Лукьянов Кирилл С.; Pavel Andreevich Yaskov; Яськов Павел Андреевич; Andrey Igorevich Perminov; Перминов Андрей Игоревич; A. P. Kovalenko; Коваленко А. П.; Denis Yur'evich Turdakov; Турдаков Денис Юрьевич

doi:10.4213/rm10208

Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution

Autores: Lukyanov K.S.¹^,2^,3, Yaskov P.A.⁴^,5, Perminov A.I.¹^,3, Kovalenko A.P.⁶, Turdakov D.Y.¹^,3
Afiliações:
1. Ivannikov Institute for System Programming of the RAS
2. Moscow Institute of Physics and Technology (National Research University)
3. Research Center of the Trusted Artificial Intelligence ISP RAS
4. Steklov Mathematical Institute of Russian Academy of Sciences
5. National University of Science and Technology «MISIS»
6. Academy of Cryptography of Russian Federation
Edição: Volume 79, Nº 6 (2024)
Páginas: 57-82
Seção: Articles
URL: https://journal-vniispk.ru/0042-1316/article/view/281941
DOI: https://doi.org/10.4213/rm10208
ID: 281941

Citar

Texto integral

Acesso aberto
Acesso é fechado

Acesso está concedido
Acesso é fechado

Somente assinantes

Resumo
Sobre autores
Bibliografia
Arquivos suplementares
Estatísticas

Resumo

This work introduces a method aimed at enhancing the reliability of the Bayesian classifier. The method involves augmenting the training dataset, which consists of a mixture of distributions from two original classes, with artificially generated observations from a third, ‘background’ class, uniformly distributed over a compact set that contains the unknown support of the original mixture.This modification allows the value of the discriminant function outside the support of the training data distribution to approach a prescribed level (in this case, zero). Adding a decision option for ‘Refusal to Classify’, triggered when the discriminant function takes sufficiently small values, results in a localized increase in classifier reliability. Specifically, this approach addresses several issues: it enables the rejection of data that differs significantly from the training data; facilitates the detection of anomalies in input data; and avoids decision-making in ‘boundary’ regions when separating classes.The paper provides a theoretical justification for the optimality of the proposed classifier. The practical utility of the method is demonstrated through classification tasks involving images and time series.Additionally, a methodology for identifying trusted regions is proposed. This methodology can be used to detect anomalous data, cases of parameter shifts in class distributions, and areas of overlap between the distributions of the original classes. Based on these trusted regions, quantitative metrics for classifier reliability and efficiency are introduced.Bibliography: 23 titles.

Palavras-chave

OOD, machine learning, Bayesian classifier, trusted machine learning, interpretability, out-of-distribution (OOD), image classification, time series classification, rejection of classification, background class

Bibliografia

A. Jishan, R. C. Green II, “Cost aware LSTM model for predicting hard disk drive failures based on extremely imbalanced S.M.A.R.T. sensors data”, Eng. Appl. Artif. Intell., 127 (2024), 107339, 11 pp.
A. Caron, C. Hicks, V. Mavroudis, A view on out-of-distribution identification from a statistical testing theory perspective, 2024, 8 pp.
Peng Cui, Jinjia Wang, “Out-of-distribution (OOD) detection based on deep learning: a review”, Electronics, 11:21 (2022), 3500, 19 pp.
L. Devroye, L. Györfi, G. Lugosi, A probabilistic theory of pattern recognition, Appl. Math. (N. Y.), 31, Reprint of the 1996 original, Springer-Verlag, New York, 2013, xvi+636 pp.
S. M. Djurasevic, U. M. Pesovic, B. S. Djordjevic, “Anomaly detection model for predicting hard disk drive failures”, Appl. Artif. Intell., 35:8 (2021), 549–566
A. Farago, G. Lugosi, “Strong universal consistency of neural network classifiers”, IEEE Trans. Inform. Theory, 39:4 (1993), 1146–1151
D. Hendrycks, K. Gimpel, A baseline for detecting misclassified and out-of-distribution examples in neural networks, 2016 (v1 – 2016), 12 pp.
J. Jithish, B. Alangot, N. Mahalingam, Kiat Seng Yeo, “Distributed anomaly detection in smart grids: a federated learning-based approach”, IEEE Access, 11 (2023), 7157–7179
A. Klein, Backblaze: Hard drive data and stats,
Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, B. Cottereau, Wei Tsang Ooi, “Robodepth: Robust out-of-distribution depth estimation under corruptions”, Adv. Neural Inf. Process. Syst., 36 (2023), 1–45
Bo Li, Peng Qi, Bo Liu, Shuai Di, Jingen Liu, Jiquan Pei, Jinfeng Yi, Bowen Zhou, “Trustworthy AI: from principles to practices”, ACM Comput. Surveys, 55:9 (2023), 177, 46 pp.
Jeremiah Zhe Liu, S. Padhy, Jie Ren, Zi Lin, Yeming Wen, G. Jerfel, Z. Nado, J. Snoek, D. Tran, B. Lakshminarayanan, “A simple approach to improve single-model deep uncertainty via distance-awareness”, J. Mach. Learn. Res., 24 (2023), 42, 63 pp.
A. B. Nassif, M. Abu Talib, Q. Nasir, F. M. Dakalbab, “Machine learning for anomaly detection: a systematic review”, IEEE Access, 9 (2021), 78658–78700
M. Perello-Nieto, T. D. M. E. S. Filho, M. Kull, P. Flach, “Background check: a general technique to build more reliable and versatile classifiers”, 2016 IEEE 16th international conference on data mining (ICDM), IEEE, 2016, 1143–1148
R. Pinciroli, L. Yang, J. Alter, E. Smirni, “Lifespan and failures of SSDs and HDDs: similarities, differences, and prediction models”, IEEE Trans. Depend. Secure Comput., 20:1 (2023), 256–272
K. Rasheed, A. Qayyum, M. Ghaly, A. Al-Fuqaha, A. Razi, J. Qadir, “Explainable, trustworthy, and ethical machine learning for healthcare: a survey”, Comput. Biol. Med., 149 (2022), 106043, 23 pp.
Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, R. Dutta, R. Schaeffer, Sang T. Truong, Simran Arora, M. Mazeika, D. Hendrycks, Zinan Lin, Yu Cheng, S. Koyejo, Dawn Song, Bo Li, DecodingTrust: a comprehensive assessment of trustworthiness in GPT models, 2024 (v1 – 2023), 110 pp.
Qibo Yang, Xiaodong Jia, Xiang Li, Jianshe Feng, Wenzhe Li, Jay Lee, “Evaluating feature selection and anomaly detection methods of hard drive failure prediction”, IEEE Trans. Reliab., 70:2 (2021), 749–760
Hang Yu, Weixu Liu, Jie Lu, Yimin Wen, Xiangfeng Luo, Guangquan Zhang, “Detecting group concept drift from multiple data streams”, Pattern Recognition, 134 (2023), 109113, 11 pp.
He Zhang, Bang Wu, Xingliang Yuan, Shirui Pan, Hanghang Tong, Jian Pei, “Trustworthy graph neural networks: aspects, methods, and trends”, Proc. IEEE, 112:2 (2024), 97–139
Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, P. Moghadam, Mingyi He, C. Walder, Kaihao Zhang, M. Harandi, N. Barnes, Dense uncertainty estimation, 2021, 15 pp.
Mingyu Zhang, Wenqiang Ge, Ruichun Tang, Peishun Liu, “Hard disk failure prediction based on blending ensemble learning”, Appl. Sci., 13:5 (2023), 3288, 22 pp.
Zhilin Zhao, Statistical methods for out-of-distribution detection, PhD thesis, Univ. Technology Sydney, 2023, 107 pp.

Arquivos suplementares

Ação

1. JATS XML

Baixar

Nome de usuário
Senha
Lembrar usuário

Esqueceu a senha?	Cadastro

Nome de usuário
Senha
Lembrar usuário

Esqueceu a senha?	Cadastro

Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution

Texto integral

Resumo

Palavras-chave

Sobre autores

Kirill Lukyanov

Pavel Yaskov

Andrey Perminov

A. Kovalenko

Denis Turdakov

Bibliografia

Arquivos suplementares