Data from National Statistical Institutes is generally considered an important source of credible evidence for a variety of users. Summarization and dissemination via traditional methods is a convenient approach for providing this evidence. However, this is usually comprehensible only for users with a considerable level of statistical literacy. A promising alternative lies in augmenting the summarization linguistically. Less statistically literate users (e.g., domain experts and the general public), as well as disabled people can benefit from such a summarization. This article studies the potential of summaries expressed in short quantified sentences. Summaries including, for example, “most visits from remote countries are of a short duration” can be immediately understood by diverse users. Linguistic summaries are not intended to replace existing dissemination approaches, but can augment them by providing alternatives for the benefit of diverse users of official statistics. Linguistic summarization can be achieved via mathematical formalization of linguistic terms and relative quantifiers by fuzzy sets. To avoid summaries based on outliers or data with low coverage, a quality criterion is applied. The concept based on linguistic summaries is demonstrated on test interfaces, interpreting summaries from real municipal statistical data. The article identifies a number of further research opportunities, and demonstrates ways to explore those.
Almeida, R.J., M-J. Lesot, B. Bouchon-Meunier, U. Kaymak, and G. Moyse. 2013. “Linguistic Summaries of Categorical Time Series Septic Shock Patient Data.” In Proceedings of the 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2013), July 7–10, 2013. 1–8. Hyderabad.
Bavdaž, M. (editor). 2011. Final Report Integrating Findings on Business Perspectives Related to NSIs Statistics. Brussels: European Commission. (Deliverable 3.2 from FP7 project BLUE-Enterprise and Trade Statistics). Blue-Ets Project: SSH-CT-2010-244767.
Bier, V. and P. Nymand-Andersen. 2011. “Communicating Statistics to Frequent Users – One Size Fits All?” In Proceedings of the Committee for the Coordination of Statistical Activities (CCSA Special Session), September 8, 2011. Luxembourg.
Duraj, A., P.S. Szczepaniak, and J. Ochelska-Mierzejewska. 2015. “Detection of Outlier Information Using Linguistic Summarization.” In Proceedings of the 11th International Conference Flexible Query Answering Systems (FQAS 2015), October 26–28, 2015. 101–113. Cracow.
Galindo, J., A. Urrutia, and M. Piattini. 2006. Fuzzy Databases––Modeling. Design and Implementation. Hershey: Idea Group Publishing.
George, R. and R. Srikanth. 1996. “Data Summarization Using Genetic Algorithms and Fuzzy Logic.” In Genetic Algorithms and Soft Computing, edited by F. Herrera and J.L. Verdegay, 599–611. Heidelberg: Physica–Verlag.
Glöckner, I. 2006. Fuzzy Quantifiers – A Computational Theory. Berlin Heidelberg: Springer-Verlag.
Goebel, R., A. Chander, K. Holzinger, F. Lecue, Z. Akata, S. Stumpf, P. Kieseberg, and A. Holzinger. 2018. “Explainable AI: The New 42?” In Machine Learning and Knowledge Extraction, Springer Lecture Notes in Computer Science LNCS 11015, edited by A. Holzinger, P. Kieseberg, A. Tjoa, and E. Weippl, 295–303. Cham: Springer.
Heimgärtner, R., A. Holzinger, and R. Adams. 2008. “From Cultural to Individual Adaptive End-User Interfaces: Helping People with Special Needs.” In Proceedings of the 11th International Conference on Computers Helping People with Special Needs (ICCHP 2008), July 9–11, 2008. 82–89. Linz.
Holzinger, A. 2002. “User-Centered Interface Design for Disabled and Elderly People: First Experiences with Designing a Patient Communication System (PACOSY).” In Proceedings of the 8th International Conference on Computer Helping People with Special Needs (ICCHP 2002), July 15–20, 2002. 33–40. Linz.
Holzinger, A., B. Malle, P. Kieseberg, P.M. Roth, H. Müller, R. Reihs, and K. Zatloukal. 2017. “Machine Learning and Knowledge Extraction in Digital Pathology needs an integrative approach.” In Towards Integrative Machine Learning and Knowledge Extraction, edited by A. Holzinger, R. Goebel, M. Ferri, and V. Palade, 13–50. Cham: Springer.
Hudec, M. 2016. Fuzziness in Information Systems – How to Deal with Crisp and Fuzzy Data in Selection, Classification, and Summarization. Cham: Springer.
Hudec, M. 2017. “Merging Validity and Coverage for Measuring Quality of Data Summaries.” In Information Technology and Computational Physics, edited by P. Kulczycki, L.T. Kóczy, R. Mesiar, and J. Kacprzyk, 71–85. Cham: Springer.
Hudec, M. and D. Praženka. 2016. “Collecting and Managing Fuzzy Data in Statistical Relational Databases.” Statistical Journal of the IAOS 32: 245 – 255. Doi: http://dx.doi.org/10.3233/SJI-160956.
Hudec, M. and V. Torres Van Grinsven. 2013. “Business’ Participants Motivation in Official Surveys by Fuzzy Logic.” In Proceedings of the 1st Eurasian Multidisciplinary Forum (EMF 2013), October 24–26, 2013. 42–52. Tbilisi.
Kacprzyk, J. and P. Strykowski. 1999. “Linguistic Data Summaries for Intelligent Decision Support.” In Proceedings of the fourth European Workshop on Fuzzy Decision Analysis and Recognition Technology for Management, Planning and Optimization (EFDAN 1999), June 14–15, 1999. 3–12. Dortmund.
Kacprzyk, J., A. Wilbik, and S. Zadroz˙ny. 2006. “Linguistic Summarization of Trends: A Fuzzy Logic Based Approach.” In Proceedings of the 11th Information Processing and Management of Uncertainty in Knowledge Based Systems (IPMU 2006), July 2–7, 2006. 2166–2172. Paris.
Kacprzyk, J. and S. Zadroz˙ny. 1995. “FQUERY for Access: Fuzzy Querying for Windows-Based DBMS.” In Fuzziness in Database Management Systems, edited by P. Bosc and J. Kacprzyk, 415–433. Heidelberg: Physica-Verlag.
Kacprzyk, J. and S. Zadroz˙ny. 2005. “Linguistic Database Summaries and Their Protoforms: Towards Natural Language Based Knowledge Discovery Tools.” Information Sciences 173: 281–304. Doi: http://dx.doi.org/10.1016/j.ins.2005.03.002.
Klement, E.P., R. Mesiar, and E. Pap. 2005. “Triangular Norms: Basic Notions and Properties.” In Logical, Algebraic, Analytic, and Probabilistic Aspects of triangular Norms, edited by E.P. Klement and R. Mesiar, 17–60. Amsterdam: Elsevier.
Liu, B. 2011. “Uncertain Logic for Modeling Human Language.” Journal of Uncertain Systems 5: 3–20. Available at: www.jus.org.uk (accessed September 2012).
Meyer, A. and H.J. Zimmermann. 2011. “Applications of Fuzzy Technology in Business Intelligence.” International Journal of Computers, Communications & Control VI(3): 428–441. Doi: http://dx.doi.org/10.15837/ijccc.2011.3.2128.
Moyse, G., M-J. Lesot, and B. Bouchon-Meunier. 2013. “Mathematical Morphology Tools to Evaluate Periodic Linguistic Summaries.” In Flexible Query Answering Systems, edited by H.L. Larsen, 257–268. Berlin Heidelberg: Springer-Verlag.
Niewiadomski, A. 2002. “Appliance of Fuzzy Relations for Text Documents Comparing.” In Proceedings of the 6th Conference on Neural Networks and Soft Computing (ICNNSC’ 2002), June 11–15, 2002. Zakopane.
Smits, G., O. Pivert, and T. Girault. 2013. “ReqFlex: Fuzzy Queries for Everyone.” In Proceedings of the 39th International Conference on Very Large Data Bases, 26–30 August, Trento.
Torres van Grinsven, V. and G. Snijkers. 2015. “Sentiments and Perceptions of Business Respondents on Social Media: An Exploratory Analysis.” Journal of Official Statistics 31: 283–304. Doi: http://dx.doi.org/10.1515/jos-2015-0018.
Wu, D., J.M. Mendel, and J. Joo. 2010. “Linguistic Summarization Using If-Then Rules.” In Proceedings of the 2010 IEEE International Conference on Fuzzy Systems, July 18–23, 2010. 1–8. Barcelona.
Yager, R.R., M. Ford, and A.J. Canas. 1990. “An Approach to the Linguistic Summarization of Data.” In Proceedings of the 3rd International Conference of Information Processing and Management of Uncertainty in Knowledge-based Systems (IPMU 1990), July 2–6, 1990. 456–468. Paris.
Zadeh, L.A. 2001. “From Computing With Numbers to Computing With Words––From Manipulation of Measurements to Manipulation of Perceptions.” In Computing with Words, edited by P. Wang, 35–68. New York: Wiley.