Hello there, I am a junior researcher in Big Scholarly Data
I'm trying to put the data into neo4j
I follow the step as :
-1 merge the data(author and paper) from MAG and Aminer
-2 store all the data as an entity CSV file, and get ready to insert into neo4j by neo4j-import tools.
Because of the limitation of string's length in neo4j-import command, some exception was thrown
Then I return to check the data in the origin JSON file and my CSV file
I found an author has an extremely long name
then I check the data with the filter like 'author name is abnormally long'
I found the list of authors' name is not right (id : name length)
(2883534986,53789)
(2809509190,231359)
(2151762742,249334)
(2130138085,1291987)
(1903406832,1758860)
(2619296525,2496188)
(244247712,3728819)
all the abnormal author's name is like :
"name": "HocomAdvies #x F\t\t1\t0\t2016-08-23\r\n2492590803\t18993\tjinho kim\tJinho Kim\t188068037\t4\t31\t2016-08-23\r\n2492590804\t20879\tantonio carlos de andrade s..."
it is combined by a lot of name with id and date
anything wrong? Are these represent a group of authors?