You are viewing the site in preview mode

Skip to main content

Advertisement

Table 4 Criteria for selecting match keys for PIAC stage 3 (examples for 60 keys)

From: Empirical aspects of record linkage across multiple data sets using statistical linkage keys: the experience of the PIAC cohort study

Key no. Linkage key Joint. unique key rate (measure A) (a)Est. number of links Est. FMR (measure B) (b)Comparison key Marginal true: false (measure C) (c)Est. 'worst case' FMR
1 s3g2|dmYOB|s|pc 99.999 55631 0.00 701 >1000 0.04
2 s3g2|dmYOB|_|pc 99.957 56120 0.00 702 >1000 0.09
3 s3g2|dm_ob|s|pc 99.878 57047 0.01 703 >1000 0.82
4 s3g2|dmYOB|s|pc2 99.993 63788 0.01 704 >1000 0.55
5 s3_|dmYOB|s|pc 99.896 56819 0.01 705 925.9 0.48
6 s3g2|dm_ob|_|pc 99.878 57547 0.02 706 578.7 1.63
7 s3g2|dmYOB|_|pc2 99.934 64338 0.02 707 592.1 1.09
8 s3_|dmYOB|_|pc 99.896 57326 0.03 708 466.2 0.95
9 s3g2|dmYOB|s|st 99.981 67206 0.04 709 317.7 1.93
10 s3g2|dmYOB|_|st 99.897 67781 0.08 710 159.5 3.82
11 s3g2|__YOB|s|pc 99.715 58484 0.12 711 103.9 15.40
12 _g2|dmYOB|s|pc 99.797 56031 0.14 712 88.2 3.17
13 s3g2|dmYOB|s|_ 99.792 67743 0.17 713 80.7 5.74
14 s3g2|__YOB|_|pc 99.613 59012 0.23 714 51.9 30.52
15 _g2|dmYOB|_|pc 99.707 56541 0.27 715 44.0 6.28
16 s3g2|dmYOB|_|_ 99.650 68327 0.29 716 44.9 10.23
17 s3g2|dm_ob|s|pc2 99.647 65447 0.34 717 36.9 10.16
18 s3_|dm_ob|s|pc 99.478 58319 0.41 718 28.9 8.84
19 s3_|dmYOB|s|pc2 99.583 65185 0.43 719 29.5 5.90
20 s3g2|dm_ob|_|pc2 99.496 66024 0.67 720 18.1 20.14
601 s3g2|dmYOB|s|pc 100.000 44977 0.00 . . . . 0.00
602 s3g2|dmYOB|_|pc 99.998 45392 0.00 601 >1000 0.00
603 s3g2|dm_ob|s|pc 99.998 46105 0.00 601 >1000 0.01
604 s3g2|dmYOB|s|pc2 100.000 51170 0.00 601 >1000 0.00
605 s3_|dmYOB|s|pc 99.992 45855 0.00 601 >1000 0.00
606 s3g2|dm_ob|_|pc 99.998 46529 0.00 603 >1000 0.01
607 s3g2|dmYOB|_|pc2 99.998 51629 0.00 604 >1000 0.01
608 s3_|dmYOB|_|pc 99.992 46276 0.00 602 >1000 0.01
609 s3g2|dmYOB|s|st 100.000 53592 0.00 604 >1000 0.02
610 s3g2|dmYOB|_|st 99.998 54071 0.00 609 >1000 0.03
611 s3g2|__YOB|s|pc 99.976 47166 0.00 601 >1000 0.12
612 _g2|dmYOB|s|pc 99.978 45258 0.00 601 >1000 0.02
613 s3g2|dmYOB|s|_ 100.000 53901 0.00 609 >1000 0.04
614 s3g2|__YOB|_|pc 99.962 47607 0.00 602 >1000 0.23
615 _g2|dmYOB|_|pc 99.976 45678 0.00 612 >1000 0.05
616 s3g2|dmYOB|_|_ 99.998 54382 0.00 613 >1000 0.09
617 s3g2|dm_ob|s|pc2 99.994 52466 0.00 604 >1000 0.08
618 s3_|dm_ob|s|pc 99.986 47016 0.00 606 776.8 0.07
619 s3_|dmYOB|s|pc2 99.968 52178 0.00 604 >1000 0.05
620 s3g2|dm_ob|_|pc2 99.992 52936 0.00 617 772.5 0.16
701 s3g2|dmYOB|s|pc 100.000 49060 0.00 . . . . 0.00
702 s3g2|dmYOB|_|pc 99.984 49502 0.00 701 >1000 0.00
703 s3g2|dm_ob|s|pc 99.957 50305 0.00 701 >1000 0.04
704 s3g2|dmYOB|s|pc2 99.996 55840 0.00 701 >1000 0.03
705 s3_|dmYOB|s|pc 99.952 50034 0.00 701 >1000 0.02
706 s3g2|dm_ob|_|pc 99.957 50757 0.00 703 >1000 0.07
707 s3g2|dmYOB|_|pc2 99.977 56333 0.00 704 >1000 0.05
708 s3_|dmYOB|_|pc 99.952 50486 0.00 702 >1000 0.04
709 s3g2|dmYOB|s|st 99.989 58515 0.00 704 >1000 0.09
710 s3g2|dmYOB|_|st 99.965 59029 0.00 709 >1000 0.18
711 s3g2|__YOB|s|pc 99.847 51479 0.00 701 >1000 0.70
712 _g2|dmYOB|s|pc 99.869 49369 0.00 701 152.5 0.14
713 s3g2|dmYOB|s|_ 99.944 58836 0.01 709 144.2 0.26
714 s3g2|__YOB|_|pc 99.792 51951 0.01 702 679.1 1.39
715 _g2|dmYOB|_|pc 99.822 49816 0.01 712 220.5 0.29
716 s3g2|dmYOB|_|_ 99.892 59352 0.01 713 174.0 0.52
717 s3g2|dm_ob|s|pc2 99.855 57266 0.01 704 251.2 0.46
718 s3_|dm_ob|s|pc 99.715 51314 0.01 706 91.6 0.40
719 s3_|dmYOB|s|pc2 99.803 56960 0.01 704 156.5 0.27
720 s3g2|dm_ob|_|pc2 99.796 57771 0.02 717 85.5 0.92
  1. (a) Estimated number of links was derived from simple deterministic matching on the key (retaining only one occurrence of duplicates).
  2. (b) Comparative linkage key is one which is slightly more detailed and includes all the match key elements of the current key. There is not a strict hierarchy for the linkage keys, so in some cases there may be more than one appropriate key for the comparison.
  3. (c) 'Worst case' FMR is estimated assuming that the number of categories within a key element is equal to that implied by the most common category (s3: 72, g2: 11, dmob: 182, yob: 19, s: 2, st: 3, pc2: 11, pc: 156, aged care assessment date: 161, assessment team identifier: 25).
  4. Note: See note to Table 3 for definition of keys; '600' series include assessment date; '700' series linkage keys include assessment team identifier. Table only includes keys that were expected to have fewer than four times as many people with non-unique match keys as SLK-581. This equates to key 20 if client region is the only additional match data, key 64 if aged care assessment date and region are included and key 46 if assessment team identifier and region are included. Keys in bold italicsare those identified as not selected for use. Table showing all tested keys is available from the authors on request.