sampleclean.clean

deduplication

package deduplication

Visibility
  1. Public
  2. All

Type Members

  1. case class ActiveLearningStrategy(displayedColNames: List[String], featurizer: Featurizer) extends Product with Serializable

    This class is used to create an Active Learning strategy that will asynchronously run an Active Learning algorithm ultimately used for deduplication.

  2. case class CrowdsourcingStrategy(displayedColNames: List[String], featurizer: Featurizer) extends Product with Serializable

    This class is used to request crowd participation

  3. class EntityResolution extends SampleCleanAlgorithm

    This is the base class for attribute deduplication.

  4. class RecordDeduplication extends SampleCleanAlgorithm

    This is an abstract class for record deduplication.

Value Members

  1. object EntityResolution

  2. object RecordDeduplication

  3. package blocker

  4. package join

  5. package matcher

Ungrouped