Various examples are provided for identification disambiguation in databases. In one example, among others, a system includes an approximate structural equivalence (ASE) analyzer. The ASE analyzer can obtain a set of records from a database; determine a knowledge homogeneity score(KHS)for a pair of records in the set of records; and determine a condition of ASE for the pair of records based upon the KHS and a predefined KHS threshold. In another example, a method can determine a plurality of references shared by at least two records in a set of records; and a weighting value for each shared reference. A KHS may then be determined for each pair of records in the set of records based upon a reference shared by the pair of records and the weighting value corresponding to the shared reference.
Identification disambiguation in databases
John P. Walsh;
2014-01-01
Abstract
Various examples are provided for identification disambiguation in databases. In one example, among others, a system includes an approximate structural equivalence (ASE) analyzer. The ASE analyzer can obtain a set of records from a database; determine a knowledge homogeneity score(KHS)for a pair of records in the set of records; and determine a condition of ASE for the pair of records based upon the KHS and a predefined KHS threshold. In another example, a method can determine a plurality of references shared by at least two records in a set of records; and a weighting value for each shared reference. A KHS may then be determined for each pair of records in the set of records based upon a reference shared by the pair of records and the weighting value corresponding to the shared reference.| File | Dimensione | Formato | |
|---|---|---|---|
|
Walsh&TangUS8799237.pdf
accesso aperto
:
Publisher’s version
Dimensione
1.83 MB
Formato
Adobe PDF
|
1.83 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


