What a wonderful add on to devtools! Thank you!
I have set up the sitemap to scrape Tripadvisor reviews, but one thing is hanging me up. Do you mind looking at it and helping please?
I am trying to collect the user name, location, number of reviews, date, and the review its self. I can get the pagnation to change the page, but when I run it, only the very last page of reviews (147th) is saved.
{"startUrl":"https://www.tripadvisor.com/Hotel_Review-g297671-d302890-Reviews-The_Oberoi_Vanyavilas-Sawai_Madhopur_Sawai_Madhopur_District_Rajasthan.html","selectors":[{"parentSelectors":["full_review_element"],"type":"SelectorText","multiple":false,"id":"title","selector":"span.noQuotes","regex":"","delay":""},{"parentSelectors":["full_review_element"],"type":"SelectorText","multiple":false,"id":"review","selector":"p.partial_entry","regex":"","delay":""},{"parentSelectors":["full_review_element"],"type":"SelectorHTML","multiple":false,"id":"time","selector":"span.ratingDate","regex":"","delay":""},{"parentSelectors":["full_review"],"type":"SelectorElement","multiple":true,"id":"profile","selector":"div.prw_rup.prw_reviews_member_info_hsx","delay":""},{"parentSelectors":["full_review_element"],"type":"SelectorText","multiple":false,"id":"reviewer_name","selector":"div.username span.expand_inline","regex":"","delay":""},{"parentSelectors":["full_review_element"],"type":"SelectorText","multiple":false,"id":"location","selector":"div.location span.expand_inline","regex":"","delay":""},{"parentSelectors":["full_review_element"],"type":"SelectorText","multiple":false,"id":"number_of_review","selector":"span.badgetext","regex":"","delay":""},{"parentSelectors":["_root","pagnation"],"type":"SelectorElementClick","multiple":true,"id":"full_review_element","selector":"div.review","delay":"","clickElementSelector":"p.partial_entry span.taLnk","clickElementUniquenessType":"uniqueText","clickType":"clickMore","discardInitialElements":true},{"parentSelectors":["_root"],"type":"SelectorElementClick","multiple":true,"id":"pagnation","selector":"div.listContainer div.unified","clickElementSelector":"div.listContainer span.pageNum:nth-of-type(n+2)","clickElementUniquenessType":"uniqueText","clickType":"clickOnce","discardInitialElements":false,"delay":"4000"}],"_id":"ramadareviews"}
I've ran it with "Discard initial elements that are available before click" on and off. I've set the delay for the pagnation to 4 seconds, and the request to 2 seconds.
Thank you to anyone how helps!