Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Bojan Cestnik

Open Access

Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Bojan Cestnik

| Dec 31, 2019

International Journal of Applied Mathematics and Computer Science

Volume 29 (2019): Issue 4 (December 2019)

New Perspectives in Nonlinear and Intelligent Control (In Honor of Alexander P. Kurdyukov) (special section, pp. 629-712), Julio B. Clempner, Enso Ikonen, Alexander P. Kurdyukov (Eds.)

About this article

Cite

Page range: 783 - 796

Received: Dec 15, 2018

Accepted: Apr 23, 2019

DOI: https://doi.org/10.2478/amcs-2019-0058

Keywords
probability estimation, small samples, minimal error, -estimate

© 2019 Bojan Cestnik, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Berger, J.O. (1985). Statistical Decision Theory and Bayesian Analysis, Springer, New York, NY.10.1007/978-1-4757-4286-2Search in Google Scholar

Bouguila, N. (2013). On the smoothing of multinomial estimates using Liouville mixture models and applications, Pattern Analysis and Applications16(3): 349–363.10.1007/s10044-011-0236-8Search in Google Scholar

Breiman, L., Friedman, J.H., Olshen, R.A. and Stone, C.J. (1984). Classification and Regression Trees, Wadsworth, Belmont.Search in Google Scholar

Calvo, B. and Santafé, G. (2016). SCMAMP: Statistical comparison of multiple algorithms in multiple problems, The R Journal8(1): 248–256.10.32614/RJ-2016-017Search in Google Scholar

Cestnik, B. (1990). Estimating probabilities: A crucial task in machine learning, Proceedings of the 9th European Conference on Artificial Intelligence, London, UK, pp. 147–149.Search in Google Scholar

Cestnik, B. (2018). Experimental framework in R for experimenting with probability estimations from small samples, https://github.com/BojanCestnik/probability-estimation.R.Search in Google Scholar

Cestnik, B. and Bratko, I. (1991). On estimating probabilities in tree pruning, Proceedings of the European Working Session on Learning, Porto, Portugal, pp. 138–150.10.1007/BFb0017010Search in Google Scholar

Chan, J.C.C. and Kroese, D.P. (2011). Rare-event probability estimation with conditional Monte Carlo, Annals of Operations Research189(1): 43–61.10.1007/s10479-009-0539-ySearch in Google Scholar

Chandra, B. and Gupta, M. (2011). Robust approach for estimating probabilities in naïve-Bayes classifier for gene expression data, Expert Systems with Applications38(3): 1293–1298.10.1016/j.eswa.2010.06.076Search in Google Scholar

DasGupta, A. (2011). Probability for Statistics and Machine Learning: Fundamentals and Advanced Topics, Springer, New York, NY.10.1007/978-1-4419-9634-3Search in Google Scholar

DeGroot, M. and Schervish, M. (2012). Probability and Statistics, Addison-Wesley, Boston, MA.Search in Google Scholar

Demšar, J. (2006). Statistical comparisons of classifiers over multiple data sets, Journal of Machine Learning Research7(1): 1–30.Search in Google Scholar

Domingos, P. and Pazzani, M. (1997). On the optimality of the simple Bayesian classifier under zero-one loss, Machine Learning29(2): 103–130.10.1023/A:1007413511361Search in Google Scholar

Džeroski, S., Cestnik, B. and Petrovski, I. (1993). Using the m-estimate in rule induction, Journal of Computing and Information Technology1(1): 37–46.Search in Google Scholar

Feller, W. (1968). An Introduction to Probability Theory and Its Applications, Willey, Hoboken, NJ.Search in Google Scholar

Fienberg, S.E. and Holland, P.W. (1972). On the choice of flattening constants for estimating multinomial probabilities, Journal of Multivariate Analysis2(1): 127–134.10.1016/0047-259X(72)90014-0Search in Google Scholar

Flach, P. (2012). Machine Learning: The Art and Science of Algorithms that Make Sense of Data, Cambridge University Press, New York, NY.10.1017/CBO9780511973000Search in Google Scholar

Fürnkranz, J. and Flach, P.A. (2005). ROC ‘n’ rule learning—towards a better understanding of covering algorithms, Machine Learning58(1): 39–77.10.1007/s10994-005-5011-xSearch in Google Scholar

García, S., Fernández, A., Luengo, J. and Herrera, F. (2010). Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power, Information Sciences180(10): 2044–2064.10.1016/j.ins.2009.12.010Search in Google Scholar

García, S. and Herrera, F. (2008). An extension on statistical comparisons of classifiers over multiple data sets for all pairwise comparisons, Journal of Machine Learning Research9(12): 2677–2694.Search in Google Scholar

Good, I.J. (1965). The Estimation of Probabilities: An Essay on Modern Bayesian Methods, MIT Press, Cambridge, MA.Search in Google Scholar

Good, I.J. (1966). How to estimate probabilities, IMA Journal of Applied Mathematics2(4): 364–383.10.1093/imamat/2.4.364Search in Google Scholar

Good, P. and Hardin, J. (2012). Common Errors in Statistics (and How to Avoid Them), Wiley, Hoboken, NJ.10.1002/9781118360125Search in Google Scholar

Grover, J. (2012). Strategic Economic Decision-Making: Using Bayesian Belief Networks to Solve Complex Problems, Springer New York, NY.10.1007/978-1-4614-6040-4Search in Google Scholar

Gudder, S. (1988). Quantum Probability, Academic Press, Boston, MA.Search in Google Scholar

Laplace, P.-S. (1814). Essai philosophique sur les probabilités, Courcier, Paris.Search in Google Scholar

Larose, D. (2010). Discovering Statistics, W.H. Freeman, New York, NY.Search in Google Scholar

Mitchell, T.M. (1997). Machine Learning, McGrawHill, Maidenhead.Search in Google Scholar

Piegat, A. and Landowski, M. (2012). Optimal estimator of hypothesis probability for data mining problems with small samples, International Journal of Applied Mathematics and Computer Science22(3): 629–645, DOI: 10.2478/v10006-012-0048-z.10.2478/v10006-012-0048-zOpen DOI Search in Google Scholar

Piegat, A. and Landowski, M. (2013). Mean square error optimal completeness estimator eph2 of probability, Journal of Theoretical and Applied Computer Science7(3): 3–20.Search in Google Scholar

Piegat, A. and Landowski, M. (2014). Specialized, MSE-optimal m-estimators of the rule probability especially suitable for machine learning, Control and Cybernetics43(1): 133–160.Search in Google Scholar

R Core Team (2018). R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, https://www.R-project.org/.Search in Google Scholar

Rudas, T. (2008). Handbook of Probability: Theory and Applications, SAGE Publications, Thousand Oaks, CA.10.4135/9781452226620Search in Google Scholar

Starbird, M. (2006). What Are the Chances? Probability Made Clear, Chantilly, VA.Search in Google Scholar

Sulzmann, J.N. and Fürnkranz, J. (2009). An empirical comparison of probability estimation techniques for probabilistic rules, in J. Gama et al. (Eds), Discovery Science, Springer, Heidelberg, pp. 317–331.10.1007/978-3-642-04747-3_25Search in Google Scholar

Webb, J. (2007). Game Theory: Decisions, Interaction and Evolution, Springer, London.Search in Google Scholar

eISSN:: 2083-8492
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Mathematics, Applied Mathematics

Journal RSS Feed

Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Published Online: Dec 31, 2019

Page range: 783 - 796

Received: Dec 15, 2018

Accepted: Apr 23, 2019

DOI: https://doi.org/10.2478/amcs-2019-0058

Keywordsprobability estimation, small samples, minimal error, -estimate

© 2019 Bojan Cestnik, published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Keywords
probability estimation, small samples, minimal error, -estimate