Mining Distance-Constrained Embedded Subtrees.
In: Mining of Data with Complex Structures; 2010, p175-190, 16p
Buch
Zugriff:
For certain applications, the distance between the nodes in a hierarchical structure could be considered important and two embedded subtrees with different distance relationships among the nodes need to be considered as separate entities. The embedded subtrees extracted using the traditional definition are incapable of being further distinguished based upon the node distance within that subtree. In this chapter, we describe the extension of the general TMG framework, to enable the mining of distance-constrained embedded subtrees, (Hadzic 2008; Tan 2008). In such subtrees, the distances of the nodes relative to the root of the subtree need to be taken into account during the candidate enumeration phase. The distances of nodes relative to the root (node depth) of a particular subtree will need to be stored and used as an additional equality criterion for grouping the enumerated candidate subtrees. In Chapter 9, we will illustrate scenarios and applications where the mining of distance-constrained embedded subtrees would be preferable to mining of traditional embedded subtrees, since the extracted subtree patterns will be more informative. We also highlight the importance of distance-constrained subtree mining in the context of web log mining, where the web logs are represented in tree-structured form. In what follows, we will discuss the importance of distance-constrained embedded subtrees from a more general perspective and relate it to some previous work on extracting tree-structured queries. [ABSTRACT FROM AUTHOR]
Copyright of Mining of Data with Complex Structures is the property of Springer Nature / Books and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Titel: |
Mining Distance-Constrained Embedded Subtrees.
|
---|---|
Autor/in / Beteiligte Person: | Hadzic, Fedja ; Tan, Henry ; Dillon, Tharam S. |
Quelle: | Mining of Data with Complex Structures; 2010, p175-190, 16p |
Veröffentlichung: | 2010 |
Medientyp: | Buch |
ISBN: | 978-3-642-17556-5 (print) |
DOI: | 10.1007/978-3-642-17557-2_7 |
Sonstiges: |
|