Identification disambiguation in databases

Walsh, John P.; Tang, Li

Various examples are provided for identification disambiguation in databases. In one example, among others, a system includes an approximate structural equivalence (ASE) analyzer. The ASE analyzer can obtain a set of records from a database; determine a knowledge homogeneity score(KHS)for a pair of records in the set of records; and determine a condition of ASE for the pair of records based upon the KHS and a predefined KHS threshold. In another example, a method can determine a plurality of references shared by at least two records in a set of records; and a weighting value for each shared reference. A KHS may then be determined for each pair of records in the set of records based upon a reference shared by the pair of records and the weighting value corresponding to the shared reference.