Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Current »

Using Memorized Classification Decisions

Starting with version 3.5.3 of OpenEMPI, there is a new feature that allows the system to memorize the classification decisions a user makes in classifying probable links from the Review Queue so, that if an update operation would cause a record pair to be placed back in the Review Queue, the system will apply the memorized, manual classification decision instead. Enabling the feature is just a matter of adding a parameter in the mpi-config.xml file. The name of the property is "remember-manual-classifications" under the admin-configuration section and it should be set to true.

...
    <admin-configuration>
        <session-duration>3600</session-duration>
        <file-repository-directory>fileRepository</file-repository-directory>
        <autostart-pixpdq>true</autostart-pixpdq>
        <data-directory>/sysnet/data</data-directory>
        <session-duration>1800</session-duration>
        <remember-manual-classifications>true</remember-manual-classifications>
...

The following example demonstrates in detail how this feature affects the matching decisions made by the system.

In this OpenEMPI instance, we have enabled this feature and the configuration of the probabilistic algorithm has been set up to classify pairs of records where there is disagreement on the postal code value to classify them as probable links. The record pair for the person by the name "Sean Richmond" is present in the Review Queue since the two records disagree in the value of the Postal Code field. Since it is clear that the two records refer to the same person, we classify this record pair as a match by clicking on the "Link" button on the bottom of the screen.

 

 

 

 
  • No labels