Splits points into two sets: the n that the random forest is least certain about, and all others.
an RDD of unlabeled points in the form (id, feature vector, labeling context).
The number of points to select.
A RandomForest model with trained weights.
Two RDDs in the same format as input, one consisting of points close to the margin, the other consisting of the remaining points.