Can Wandora Extract This Information from Reddit?

Forum is for miscellaneous user help requests.

Can Wandora Extract This Information from Reddit?

Postby tbk2015 » Thu Jun 25, 2015 1:21 am

I would like to use Wandora to extract the following information:

1. A list of threads that contain a certain keyword (and the dates it was posted).
2. The text from each of those threads, including the entire comment tree.

Basically, I would like to get a list of the search results that contain the keyword, and the entire series of comments and messages associated with that search result.

Is this possible with Wandora?

Thanks for your help!
tbk2015
 
Posts: 1
Joined: Thu Jun 25, 2015 1:19 am

Re: Can Wandora Extract This Information from Reddit?

Postby akivela » Fri Jun 26, 2015 9:48 am

Hi Tbk2015

This is how it SHOULD happen:

  • Downalod the most recent Wandora release and start the application.
  • Start Reddit extractor by selecting menu option File > Extract > Social > Reddit extractor....
  • Select tab Submission Search if it isn't active.
  • Write your keyword to the text field beside Search button and mouse click the button.
  • Wait while Wandora fetches search results to the select list below the text field.
  • Select submissions in the select list. Use SHIFT and CTRL keys to select multiple submissions.
  • Check Crawl Comment Tree option in crawling options.
  • Click Extract button. Wandora starts extraction.
  • When ready, search for a reddit and you should find topics representing the submission and comments. Submission topic is associated with it's comments. Actual comment texts locate in comment topics as occurrences.

HOWEVER, it looks like Wandora fails to extract several submissions at once. Even if I select multiple submissions, Wandora extracts only first. This is clearly an unfinished feature. I have to put it into my working list. I'll write a notification post to this thread once the feature works as intended.

Kind Regards,
Aki / Wandora Team
akivela
Site Admin
 
Posts: 256
Joined: Tue Sep 18, 2007 10:20 am
Location: Helsinki, Finland

Re: Can Wandora Extract This Information from Reddit?

Postby akivela » Wed Jul 15, 2015 6:52 pm

Hi Tbk2015

For your information, I have pushed Reddit extractor fixes and enhancements to out master repository:

https://github.com/wandora-team/wandora/commit/99b23e368d096c92c3333478089c94a22b288cf0

If you are in a hurry, please download Wandora's source code and compile the application with Netbeans IDE. If you are not in a hurry, these fixes and enhancements will be on next version of the Wandora application released sometime in August. I haven't decided the exact date yet.

Kind Regards,
Aki / Wandora Team
akivela
Site Admin
 
Posts: 256
Joined: Tue Sep 18, 2007 10:20 am
Location: Helsinki, Finland

Re: Can Wandora Extract This Information from Reddit?

Postby akivela » Sat Aug 08, 2015 10:23 am

For your information, new Wandora versio (2015-08-06) was released few days ago. The release fixes and enhances the Reddit extractor as described in my earlier post.

Kind Regards,
Aki / Wandora Team
akivela
Site Admin
 
Posts: 256
Joined: Tue Sep 18, 2007 10:20 am
Location: Helsinki, Finland

Re: Can Wandora Extract This Information from Reddit?

Postby akivela » Sat Aug 08, 2015 10:28 am

For your information, new Wandora versio (2015-08-06) was released few days ago. The release fixes and enhances the Reddit extractor as described in my earlier post.
Kind Regards,
Aki / Wandora Team
akivela
Site Admin
 
Posts: 256
Joined: Tue Sep 18, 2007 10:20 am
Location: Helsinki, Finland


Return to How to... and problems

Who is online

Users browsing this forum: No registered users and 1 guest