MakeSIWithTextdata
Generating topic maps with extractors or otherwise automatically may result topics containing important identifier data in occurrences (text datas). To clean up the topics you may need to move this important identifier data to subject identifiers where it behaves more like identifier.
Tool enables user to create subject identifiers for context topics using selected text data. When executed tool first asks what is the type of used text data. Then tool asks what is the scope of used text data. Scope refers to the langugage of the text data. Finally tool requests the URI body for the subject identifier. Default URI body is
http://www.wandora.net/defaultSI/%OCCURRENCE%
Tool replaces %OCCURRENCE% in the URI body with topic's text data. All special characters such as newline cratacter are discarded from the created URI before it is added as a subject identifier to the topic.
Note also that generating subject identifiers with identical text datas causes topic merges. Exploiting this method the tool can be used collapse topic groups into a single topic. Tool is aware of potential subject identifier explosion. Tool checks if topic has more than 10 subject identifiers after the SI addition and removes extra identifiers if user desires.
GUI name
- Make SI with text data