Hi all,
As a newcomer to the field, I have some feedback to the document: "WDC Product Data Corpus and Gold Standard for Large-Scale Product Matching - Version 2.0".
Feedbacks:
1. I felt the algorithm of pair selection is too complicated to explain in a natural language. Can you provide us the code/pseudo code of this algorithm?
2. First time I read the content, I felt a bit confused with the technical terms like "offer", "cluster", "gold standard", etc. I think it should be easier for newcomers if there are a list of definitions of these technical terms.
Thanks.
Tatsuhiko