Abstract:
While Kolmogorov (1965) complexity is the accepted absolute measure of information content in an individual finite object, a similarly absolute notion is needed for the i...Show MoreMetadata
Abstract:
While Kolmogorov (1965) complexity is the accepted absolute measure of information content in an individual finite object, a similarly absolute notion is needed for the information distance between two individual objects, for example, two pictures. We give several natural definitions of a universal information metric, based on length of shortest programs for either ordinary computations or reversible (dissipationless) computations. It turns out that these definitions are equivalent up to an additive logarithmic term. We show that the information distance is a universal cognitive similarity distance. We investigate the maximal correlation of the shortest programs involved, the maximal uncorrelation of programs (a generalization of the Slepian-Wolf theorem of classical information theory), and the density properties of the discrete metric spaces induced by the information distances. A related distance measures the amount of nonreversibility of a computation. Using the physical theory of reversible computation, we give an appropriate (universal, antisymmetric, and transitive) measure of the thermodynamic work required to transform one object in another object by the most efficient process. Information distance between individual objects is needed in pattern recognition where one wants to express effective notions of "pattern similarity" or "cognitive similarity" between individual objects and in thermodynamics of computation where one wants to analyze the energy dissipation of a computation from a particular input to a particular output.
Published in: IEEE Transactions on Information Theory ( Volume: 44, Issue: 4, July 1998)
DOI: 10.1109/18.681318
Citations are not available for this document.
Cites in Papers - |
Cites in Papers - IEEE (73)
Select All
1.
Maria J. P. Sousa, Armando J. Pinho, Diogo Pratas, "Asensitive Compression-Based Method for Filtering Targeted FASTQ Sequencing Reads", 2024 32nd European Signal Processing Conference (EUSIPCO), pp.1561-1565, 2024.
2.
Maria J. P. Sousa, Armando J. Pinho, Diogo Pratas, "Asensitive Compression-Based Method for Filtering Targeted FASTQ Sequencing Reads", 2024 32nd European Signal Processing Conference (EUSIPCO), pp.1561-1565, 2024.
3.
Yujie Zhang, Qi Yang, Yifei Zhou, Xiaozhong Xu, Le Yang, Yiling Xu, "TCDM: Transformational Complexity Based Distortion Metric for Perceptual Point Cloud Quality Assessment", IEEE Transactions on Visualization and Computer Graphics, vol.30, no.10, pp.6707-6724, 2024.
4.
Hepeng Dai, Chang-Ai Sun, Huai Liu, Xiangyu Zhang, "DFuzzer: Diversity-Driven Seed Queue Construction of Fuzzing for Deep Learning Models", IEEE Transactions on Reliability, vol.73, no.2, pp.1075-1089, 2024.
5.
Zohreh Aghababaeyan, Manel Abdellatif, Lionel Briand, Ramesh S, Mojtaba Bagherzadeh, "Black-Box Testing of Deep Neural Networks through Test Case Diversity", IEEE Transactions on Software Engineering, vol.49, no.5, pp.3182-3204, 2023.
6.
Su Gao, Luhai Fan, "Clustering Information Distance by Nested-Lattice Compression", IEEE Access, vol.7, pp.156228-156236, 2019.
7.
Mihai Coca, Andrei Anghel, Mihai Datcu, "Unbiased Seamless SAR Image Change Detection Based on Normalized Compression Distance", IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.12, no.7, pp.2088-2096, 2019.
8.
Diogo Pratas, Armando J. Pinho, "Metagenomic composition analysis of sedimentary ancient DNA from the Isle of Wight", 2018 26th European Signal Processing Conference (EUSIPCO), pp.1177-1181, 2018.
9.
David Pyles, Cornelius Brinegar, Michael A. Saville, "Analysis of Kinematic Model Effects on SAR ECM", NAECON 2018 - IEEE National Aerospace and Electronics Conference, pp.309-317, 2018.
10.
Rituparna Sarkar, Scott T. Acton, "SDL: Saliency-Based Dictionary Learning Framework for Image Similarity", IEEE Transactions on Image Processing, vol.27, no.2, pp.749-763, 2018.
11.
Marion Revolle, François Cayre, Nicolas Le Bihan, "Clustering and causality inference using algorithmic complexity", 2017 25th European Signal Processing Conference (EUSIPCO), pp.843-847, 2017.
12.
Paul M. B. Vitányi, "Exact Expression For Information Distance", IEEE Transactions on Information Theory, vol.63, no.8, pp.4725-4728, 2017.
13.
Lav R. Varshney, Julius Kusuma, Vivek K Goyal, "On Palimpsests in Neural Memory: An Information Theory Viewpoint", IEEE Transactions on Molecular, Biological and Multi-Scale Communications, vol.2, no.2, pp.143-153, 2016.
14.
Rituparna Sarkar, Scott T. Acton, "Slide: Saliency guided image dictionary and image similarity evaluation", 2016 IEEE International Conference on Image Processing (ICIP), pp.216-220, 2016.
15.
Robert Feldt, Simon Poulding, David Clark, Shin Yoo, "Test Set Diameter: Quantifying the Diversity of Sets of Test Cases", 2016 IEEE International Conference on Software Testing, Verification and Validation (ICST), pp.223-233, 2016.
16.
Armando J. Pinho, Diogo Pratas, Paulo J. S. G. Ferreira, "Authorship Attribution Using Relative Compression", 2016 Data Compression Conference (DCC), pp.329-338, 2016.
17.
En-hui Yang, "On coding for data analytics: New information distances", 2016 Information Theory and Applications Workshop (ITA), pp.1-6, 2016.
18.
David Threm, Liguo Yu, S. Ramaswamy, S D Sudarsan, "Using normalized compression distance to measure the evolutionary stability of software systems", 2015 IEEE 26th International Symposium on Software Reliability Engineering (ISSRE), pp.112-120, 2015.
19.
Susana Brás, Armando J. Pinho, "ECG biometric identification: A compression based approach", 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp.5838-5841, 2015.
20.
J. M. Lillo-Castellano, I. Mora-Jiménez, R. Santiago-Mozos, F. Chavarría-Asso, A. Cano-González, A. García-Alberola, J. L. Rojo-Álvarez, "Symmetrical Compression Distance for Arrhythmia Discrimination in Cloud-Based Big-Data Services", IEEE Journal of Biomedical and Health Informatics, vol.19, no.4, pp.1253-1263, 2015.
21.
Andrew R. Cohen, Paul M.B. Vitányi, "Normalized Compression Distance of Multisets with Applications", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.37, no.8, pp.1602-1614, 2015.
22.
Patrick Köthur, Mike Sips, Henryk Dobslaw, Doris Dransch, "Visual Analytics for Comparison of Ocean Model Output with Reference Data: Detecting and Analyzing Geophysical Processes Using Clustering Ensembles", IEEE Transactions on Visualization and Computer Graphics, vol.20, no.12, pp.1893-1902, 2014.
23.
Michael Taynnan Barros, Sasitharan Balasubramaniam, Brendan Jennings, "Using Information Metrics and Molecular Communication to Detect Cellular Tissue Deformation", IEEE Transactions on NanoBioscience, vol.13, no.3, pp.278-288, 2014.
24.
Michael Taynnan Barros, Sasitharan Balasubramaniam, Brendan Jennings, Yevgeni Koucheryavy, "Adaptive transmission protocol for molecular communications in cellular tissues", 2014 IEEE International Conference on Communications (ICC), pp.3981-3986, 2014.
25.
Paulo J. S. G. Ferreira, Armando J. Pinho, "Compression-based normal similarity measures for DNA sequences", 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.419-423, 2014.
26.
Sara P. Garcia, Joao M. O. S. Rodrigues, Sergio Santos, Diogo Pratas, Vera Afreixo, Carlos Bastos, Paulo J. S. G. Ferreira, Armando J. Pinho, "A Genomic Distance for Assembly Comparison Based on Compressed Maximal Exact Matches", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.10, no.3, pp.793-798, 2013.
27.
Guorong Li, Qingming Huang, Lei Qin, Shuqiang Jiang, "SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently", IEEE Transactions on Circuits and Systems for Video Technology, vol.23, no.4, pp.695-709, 2013.
28.
Joel Ratsaby, Vadim Sirota, "FPGA-based data compressor based on prediction by partial matching", 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel, pp.1-5, 2012.
29.
Xianglilan Zhang, Hongnan Wang, Tony J. Collins, Zhigang Luo, Ming Li, "Cytoplasm image classification based on Kolmogorov complexity", 2012 5th International Conference on BioMedical Engineering and Informatics, pp.256-260, 2012.
30.
Andre Burkovski, Sebastian Klenk, Gunther Heidemann, "Similarity Calculation with Length Delimiting Dictionary Distance", 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence, pp.856-864, 2011.
Cites in Papers - Other Publishers (248)
1.
Markus Knoflacher, "Perplexing Cats and Demons: Pointers to the Quantum-Physical Foundations of Life", Relativity of Evolution, pp.25, 2024.
2.
Neelofar Neelofar, Aldeida Aleti, "Towards Reliable AI: Adequacy Metrics for Ensuring the Quality of System-Level Testing of Autonomous Vehicles", 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE), pp.816-827, 2024.
3.
Tasnim ALASALI, Omar DAKKAK, "EXPLORING THE LANDSCAPE OF SDN-BASED DDOS DEFENSE: A HOLISTIC EXAMINATION OF DETECTION AND MITIGATION APPROACHES, RESEARCH GAPS AND PROMISING AVENUES FOR FUTURE EXPLORATION", International Journal of Advanced Natural Sciences and Engineering Researches, vol.7, no.4, pp.327, 2023.
4.
Thomas Chambon, Jean-Loup Guillaume, Jeanne Lallement, "Information Complexity Ranking: A New Method of Ranking Images by Algorithmic Complexity", Entropy, vol.25, no.3, pp.439, 2023.
5.
Artemy Kolchinsky, "Generalized Zureks bound on the cost of an individual classical or quantum computation", Physical Review E, vol.108, no.3, 2023.
6.
"Diatom Morphological Complexity Over Time as a Measurable Dynamical System", Mathematical Macroevolution in Diatom Research, pp.355, 2023.
7.
Xavier Oriols, Hrvoje Nikolic, "Three types of Landauer?s erasure principle: a microscopic view", The European Physical Journal Plus, vol.138, no.3, 2023.
8.
Ilir Capuni, Jarkko Kari, Alexander Shen, "Taming randomness and complexity – Essays in honour of Professor Péter Gács", Theoretical Computer Science, vol.949, pp.113776, 2023.
9.
Rudi L. Cilibrasi, Paul M. B. Vitanyi, "Fast Phylogeny of SARS-CoV-2 by Compression", Entropy, vol.24, no.4, pp.439, 2022.
10.
Laurent Beaudoin, Loïca Avanthey, "How to help digital-native students to successfully take control of their learning: A return of 8 years of experience on a computer science e-learning platform in higher education", Education and Information Technologies, 2022.
11.
Nikolay Vereshchagin, "Information disclosure in the framework of Kolmogorov complexity", Theoretical Computer Science, 2022.
12.
Sahar Tahvili, Leo Hatvani, "Transformation, vectorization, and optimization", Artificial Intelligence Methods for Optimization of the Software Testing Process, pp.35, 2022.
13.
Andrei Romashchenko, "Clustering with Respect to the Information Distance", Theoretical Computer Science, 2022.
14.
A. O. Lopes, R. Ruggiero, "Nonequilibrium in Thermodynamic Formalism: The Second Law, Gases and Information Geometry", Qualitative Theory of Dynamical Systems, vol.21, no.1, 2022.
15.
Forrest Webler, Marilyne Andersen, "Measurement in the Age of Information", Information, vol.13, no.3, pp.111, 2022.
16.
Songsong Dai, "Second quantised information distance", IET Quantum Communication, 2022.
17.
Songsong Dai, "Quantum information distance based on classical descriptions", Quantum Machine Intelligence, vol.4, no.1, 2022.
18.
Tiasa Mondol, Daniel G. Brown, "Computational Creativity and Aesthetics with Algorithmic Information Theory", Entropy, vol.23, no.12, pp.1654, 2021.
19.
J. A. Tenreiro Machado, J. M. Rocha-Neves, Filipe Azevedo, J. P. Andrade, "Advances in the computational analysis of SARS-COV2 genome", Nonlinear Dynamics, 2021.
20.
Tyler Millhouse, "Really Real Patterns", Australasian Journal of Philosophy, pp.1, 2021.
21.
Ravi Kiran Raman, Lav R. Varshney, "Universal Clustering", Information-Theoretic Methods in Data Science, pp.263, 2021.
22.
Kenong Su, Tianwei Yu, Hao Wu, "Accurate feature selection improves single-cell RNA-seq cell clustering", Briefings in Bioinformatics, 2021.
23.
Asmeret Naugle, Stephen Verzi, Kiran Lakkaraju, Laura Swiler, Christina Warrender, Michael Bernard, Vicente Romero, "Feedback density and causal complexity of simulation model structure", Journal of Simulation, pp.1, 2021.
24.
Klaus Ambos-Spies, Wolfgang Merkle, Sebastiaan A. Terwijn, "Normalized Information Distance and the Oscillation Hierarchy", Journal of Computer and System Sciences, 2021.
25.
Mariana S. Ramos, João M. Carvalho, Armando J. Pinho, Susana Brás, "On the Impact of the Data Acquisition Protocol on ECG Biometric Identification", Sensors, vol.21, no.14, pp.4645, 2021.
26.
Helena Miton, Olivier Morin, "Graphic complexity in writing systems", Cognition, vol.214, pp.104771, 2021.
27.
Tahereh Rezaei, Shahabeddin M. Aslmarand, Robert Snyder, Behzad Khajavi, Paul M. Alsing, Michael Fanto, Doyeol Ahn, Warner A. Miller, "Experimental realization of Schumacher's information geometric Bell inequality", Physics Letters A, pp.127444, 2021.
28.
Milton Silva, Diogo Pratas, Armando J. Pinho, "AC2: An Efficient Protein Sequence Compression Tool Using Artificial Neural Networks and Cache-Hash Models", Entropy, vol.23, no.5, pp.530, 2021.
29.
Liguo Yu, "Using Kolmogorov Complexity to Study the Coevolution of Header Files and Source Files of C-alike Programs", Research Anthology on Recent Trends, Tools, and Implications of Computer Programming, pp.814, 2021.
30.
Tiasa Mondol, Daniel G. Brown, "Grammar-based compression and its use in symbolic music analysis", Journal of Mathematics and Music, pp.1, 2021.