HTML property table extractor

From WandoraWiki
(Difference between revisions)
Jump to: navigation, search
Line 19: Line 19:
  
 
When this table is processed with Property Table Extractor, it is dragged to Wandora's Drag and Drop extractor for example, an association results. This association has type '''Republic of Finland''' and contains four players '''Helsinki''', '''Parliamentary democracy''', '''338,145 km2''', and '''5,297,316'''. Respectively roles of players are '''Capital''', '''Goverment''', '''Area''', and '''Population'''. If you look Wandora, the association should look like this  
 
When this table is processed with Property Table Extractor, it is dragged to Wandora's Drag and Drop extractor for example, an association results. This association has type '''Republic of Finland''' and contains four players '''Helsinki''', '''Parliamentary democracy''', '''338,145 km2''', and '''5,297,316'''. Respectively roles of players are '''Capital''', '''Goverment''', '''Area''', and '''Population'''. If you look Wandora, the association should look like this  
 
  
 
[[Image:Property_table_extractor_example.gif|center]]
 
[[Image:Property_table_extractor_example.gif|center]]
 +
 +
The Property table extractor enables easy screen scraping method as HTML fragments may be copied directly from WWW browser such as Firefox.

Revision as of 19:42, 5 November 2007

Property table extractor reads simple HTML fragments containing table, tr, and td elements and converts table content to single association. Extractor expects the table contains only two columns where first column contains property name and second the property value. Each property and property value are considered as an association role and association player of same association. If first line does not contain property value the line is considered as a association type. For example consider rather typical table

Republic of Finland
Capital Helsinki
Government Parliamentary democracy
Area 338,145 km2
Population 5,297,316

When this table is processed with Property Table Extractor, it is dragged to Wandora's Drag and Drop extractor for example, an association results. This association has type Republic of Finland and contains four players Helsinki, Parliamentary democracy, 338,145 km2, and 5,297,316. Respectively roles of players are Capital, Goverment, Area, and Population. If you look Wandora, the association should look like this

Property table extractor example.gif

The Property table extractor enables easy screen scraping method as HTML fragments may be copied directly from WWW browser such as Firefox.

Personal tools