HTML property table extractor

From WandoraWiki
(Difference between revisions)
Jump to: navigation, search
 
(10 intermediate revisions by one user not shown)
Line 1: Line 1:
Property table extractor reads simple HTML fragments containing '''table''', '''tr''', and '''td''' elements and converts table content to single association. Extractor expects the table contains only two columns where first column contains property name and second the property value. Each property and property value are considered as an association role and association player of same association. If first line does not contain property value the line is considered as a association type. For example consider rather typical table
+
Property table extractor reads simple HTML fragments containing '''table''', '''tr''', and '''td''' elements and converts table content to single association. Extractor expects the table contains only two columns where first column contains property name and second the property value. Each property and property value are considered as an association role and association player of same association. If first line does not contain property value the line is considered as association type. For example consider rather typical table
  
{| cellspacing="0" cellpadding="5" width="100%" background-color="transparent" border="1"
+
 
 +
{| cellspacing="0" cellpadding="5" width="60%" background-color="transparent" border="1" align="center"
 
|-
 
|-
 
| Republic of Finland
 
| Republic of Finland
Line 18: Line 19:
 
|}
 
|}
  
When this table is processed with Property Table Extractor, it is dragged to Wandora's Drag and Drop extractor for example, an association results. This association has type '''Republic of Finland''' and contains four players '''Helsinki''', '''Parliamentary democracy''', '''338,145 km2''', and '''5,297,316'''. Respectively roles of players are '''Capital''', '''Goverment''', '''Area''', and '''Population'''. If you look Wandora, the association should look like this
 
  
 +
When this table is processed with Property Table Extractor, it is dragged to Wandora's Drag and Drop extractor for example, an association results. This association has type '''Republic of Finland''' and contains four players '''Helsinki''', '''Parliamentary democracy''', '''338,145 km2''', and '''5,297,316'''. Respectively roles of players are '''Capital''', '''Goverment''', '''Area''', and '''Population'''. If you look at Wandora, the association looks like this
  
 
[[Image:Property_table_extractor_example.gif|center]]
 
[[Image:Property_table_extractor_example.gif|center]]
 +
 +
The Property table extractor enables easy screen scraping method as HTML fragments may be dragged directly from WWW browser such as Firefox. See also [[HTML association table extractor]] for a slightly different HTML table extraction method.

Latest revision as of 20:46, 22 January 2008

Property table extractor reads simple HTML fragments containing table, tr, and td elements and converts table content to single association. Extractor expects the table contains only two columns where first column contains property name and second the property value. Each property and property value are considered as an association role and association player of same association. If first line does not contain property value the line is considered as association type. For example consider rather typical table


Republic of Finland
Capital Helsinki
Government Parliamentary democracy
Area 338,145 km2
Population 5,297,316


When this table is processed with Property Table Extractor, it is dragged to Wandora's Drag and Drop extractor for example, an association results. This association has type Republic of Finland and contains four players Helsinki, Parliamentary democracy, 338,145 km2, and 5,297,316. Respectively roles of players are Capital, Goverment, Area, and Population. If you look at Wandora, the association looks like this

Property table extractor example.gif

The Property table extractor enables easy screen scraping method as HTML fragments may be dragged directly from WWW browser such as Firefox. See also HTML association table extractor for a slightly different HTML table extraction method.

Personal tools