added min distance algorithm, used to identify the connected components (dedup)
added more mapping tests, using xslt picked from services.openaire