Hi All,
Im doing my master thesis at the CLTL (Piek Vossen’s group) and im currently trying to implement the UKB Similarity on a set of sentences.
The aim is to run these scripts on my dataset of which you can below see an example
Cost of cverage under obamacare to increase in 2015, Americans with health insurance bought under the affordable are
the price will rice 5 percent, About a quarter of counties with one or two insurers saw an increase in rates of more than 10 percent
Increases in insurance costs, there are notable increases
Premiums rising faster than eight years before, Health insurance premiums have risen more
My aim is to check all the sentences in the left column with all the sentences in the right column and get a score per pair checked on similarity.
So I have some specific questions on this matter:
- How can I run the
similarity.pl script on this set sentence columns
- Is there a specified format for the input data?
- I am using the UKBSim in the Mac OS terminal/command line; what steps do i need to take to run the script on the dataset in the terminal with the data set (in the right format)
Thanks in advance,
Kind regards,
Yassine