## Binary Distance Calculator

- This tool currently computes 16 distance measures from two binary data sets of same size.
- A dynamic counter helps users check the size of the sets.
- Nonbinary sets must be first converted to binary (e.g., Yes=1 and No=0).
- Example: If A={Yes,No,Yes,Yes,No} and B={Yes,Yes,No,Yes,Yes}, then A=10110 and B=11011.
- This tool complements our Binary Similarity Calculator.

- Binary distance metrics are computed.
- Before making similarity-distance transformations, you may want to read our companion tutorial which discusses this topic (Garcia, 2015).
- The tool is based on the following sources: Choi, et al. (2010), IBM Knowledge Center (2011), Stata (2007), Hayek (1994), Hayes (1978), and Tulloss (1997).
- Please let us know how we can improve, enhance, or correct this tool.

- Data miners, teachers, or anyone that need to compare or grade data sets.

- Choi, S., Cha, S., and Tappert, C. C. (2010). A Survey of Binary Similarity and Distance Measures. Systemics, Cybernetics and Informatics, Vol. 8, 1, 43-48.
- Garcia, E (2015). A Tutorial on Distance and Similarity.
- Hayek, L. C. (1994). Analysis of Amphibian Biodiversity Data. Chapter 9. In: Measuring and monitoring miological diversity. Standard methods for amphibians. W. R. Heyer et al., eds. Smithsonian Institution, Washington, D. C.
- Hayes, W. B. (1978). Some Sampling Properties of the Fager Index for Recurrent Species Groups. Vol. 59, No. 1.
- IBM Knowledge Center (2011). Distances Similarity Measures for Binary Data. See also Sokal and Sneath Similarity Measure 3.
- Stata (2007). Stata Manuals.
- Tulloss, R. E. (1997). Assessment of Similarity Indices for Undesirable Properties and a new Tripartite Similarity Index Based on Cost Functions. Offprint from Palm, M. E. and I. H. Chapela, eds. 1997. Mycology in Sustainable Development: Expanding Concepts, Vanishing Borders. pp 122-143.

#### Feedback

Contact us for any suggestion or question regarding this tool.