by akivela » Thu Oct 27, 2011 11:00 am
Hi Gimley
Once you have successfully configured Wandora's Stanford NER extractor by addressing the SER file, you can perform Stanford NER extractions following next simple steps:
1. Select menu option File > Extract > Classification > Stanford Named Entity Recognizer. When selected, Wandora opens a dialog window.
2. In the dialog window, select a tab that reflects your data source. Available data sources are Files, Urls and Raw. If you just want to test drive the classifier, I suggest you select the Raw tab.
3. If you selected the Raw tab, write (or copy and paste) some English text into the text field. If you selected Urls tab, write some url addresses to the text field. If you selected Files tab, select some text files to be classified.
4. Once you have addressed the source data, press Extract button. Now Wandora reads your data and performs classifications using the Stanford Named Entity Recognizer.
5. Once the extraction/classification is finished, Wandora views a log window. The log window contains the number of found entities. Close the log window.
6. If the extraction/classification was successful, Wandora should contain new topics and associations now. You can view the topics in Wandora's topic tree. Topic tree locates left, under the Topics tab. Tree root labeled as Wandora Class has now two new branches labeled as Document and Stanford NER. You'll find all data source documents under the Document and all found entity topics under the Stanford NER. Each data source document topic has associations that link the document to all recognized entities.
I hope you'll be able to perform successful classifications following these instructions. Notice, Wandora can also extract entities out of occurrences but that's another use case. Face still problems, please drop a line.
Kind Regards,
Aki / Wandora Team
Last edited by
akivela on Thu Oct 27, 2011 4:07 pm, edited 1 time in total.