New York Times Article Search API extractor

From WandoraWiki
Revision as of 14:17, 30 April 2012 by Akivela (Talk | contribs)

Jump to: navigation, search

Wandora's New York Times Article Search API extractor performs an api request to New York Times Article Search API and transforms the response JSON to topics and associations. After successful extraction Wandora contains information about New York Times articles and extracted information can be visualized, exported and modified in Wandora.

Wandora user has to sign up for a personal API key in order to make extractions using Wandora application. Wandora doesn't store your api-key between use sessions. If you find the extracted information useful and want to use it, read API Terms of Use carefully.

Example

In this example, Wandora user performs an article search to New York Times Article Search API with a query obama. First, user locates the extractor in File > Extract > News > New York Times Article Search API extractor and selects the menu option.


Nytimes example 01.gif


Wandora opens up an extractor dialog. Wandora user fills in a search query and selects fields to be extracted. Listed fields are a subset of fields described in New York Times Article Search API documentation. We experienced some difficulties in extracting too many fields: The number of result rows in response dropped down to one instead of default ten. Notice, you can always perform sequential extractions with a different set of fields. As Wandora merges article topics automatically, latter results merge nicely with earlier results. In this example user selects to extract fields author, body, byline, column_facet, date and title. Article url is added to the selected fields automatically by Wandora. Article url is used as a subject identifier of an article topic.


Nytimes example 02.gif


Wandora user clicks the Extract button and Wandora asks user's api-key. Wandora keeps your api-key stored in memory during a use session. Your api-key is not stored between use sessions. If you need to change the api-key during a use session, press the Forget api-key button in query dialog. Wandora asks the key again once you perform an extraction.


Nytimes example 03.gif


If Wandora gets a valid response, the application parses response JSON and transforms included entities to topics and associations. One response JSON contains information about 10 articles. Wandora notices if there is more articles available and asks how to proceed.


Nytimes example 04.gif


Available options are

  • Do not extract any more pages.
  • Extract only next page
  • Extract next page
  • Extract 10 next pages
  • Extract all next pages

Wandora user should notice that the Article Search API usage is limited to 5000 requests per day and extracting one page takes one request. Thus, use option Extract all next pages carefully.

After successful extraction, Wandora user finds a topic New york Times API just below Wandora class. Opening the topic reveals extracted articles and used type topics.


Nytimes example 05.gif


Opening single article topic in Traditional topic panel views all information (i.e. selected fields) associated with the article.


Nytimes example 06.gif


For example, article body is modeled as an occurrence attached to the article topic.


Nytimes example 07.gif


By-lines are modeled as topics associated with the article topic. Similarly, facets are generally modeled as topics associated with the article topic.


Nytimes example 08.gif


All facet topics are typed with article-facet topic.


Nytimes example 09.gif


Opening one facet topic, say Letter, lists all articles that have the facet value.


Nytimes example 10.gif

See also

Once you have performed an article search using the New York Times Article Search API extractor, you might be interested in classifying articles:

Personal tools