This method builds an Record Deduplication algorithm that will resolve automatically.
names of attributes that will be used for deduplication
threshold used in the algorithm. Must be between 0.0 and 1.0
If set to true, the algorithm will automatically calculate token weights. Default token weights are defined based on token idf values.
Adding weights into the join might lead to more reliable pair comparisons and speed up the algorithm if there is an abundance of common words in the dataset.