Research and Development in Intelligent Systems XXVI - Incorporating Applications and Innovations in Intelligent Systems XVII (Paperback, 2010 ed.)


The most common document formalisation for text classi?cation is the vector space model founded on the bag of words/phrases representation. The main advantage of the vector space model is that it can readily be employed by classi?cation - gorithms. However, the bag of words/phrases representation is suited to capturing only word/phrase frequency; structural and semantic information is ignored. It has been established that structural information plays an important role in classi?cation accuracy [14]. An alternative to the bag of words/phrases representation is a graph based rep- sentation, which intuitively possesses much more expressive power. However, this representation introduces an additional level of complexity in that the calculation of the similarity between two graphs is signi?cantly more computationally expensive than between two vectors (see for example [16]). Some work (see for example [12]) has been done on hybrid representations to capture both structural elements (- ing the graph model) and signi?cant features using the vector model. However the computational resources required to process this hybrid model are still extensive.

R6,001

Or split into 4x interest-free payments of 25% on orders over R50
Learn more

Discovery Miles60010
Mobicred@R562pm x 12* Mobicred Info
Free Delivery
Delivery AdviceShips in 10 - 15 working days


Toggle WishListAdd to wish list
Review this Item

Product Description

The most common document formalisation for text classi?cation is the vector space model founded on the bag of words/phrases representation. The main advantage of the vector space model is that it can readily be employed by classi?cation - gorithms. However, the bag of words/phrases representation is suited to capturing only word/phrase frequency; structural and semantic information is ignored. It has been established that structural information plays an important role in classi?cation accuracy [14]. An alternative to the bag of words/phrases representation is a graph based rep- sentation, which intuitively possesses much more expressive power. However, this representation introduces an additional level of complexity in that the calculation of the similarity between two graphs is signi?cantly more computationally expensive than between two vectors (see for example [16]). Some work (see for example [12]) has been done on hybrid representations to capture both structural elements (- ing the graph model) and signi?cant features using the vector model. However the computational resources required to process this hybrid model are still extensive.

Customer Reviews

No reviews or ratings yet - be the first to create one!

Product Details

General

Imprint

Springer London

Country of origin

United Kingdom

Release date

November 2009

Availability

Expected to ship within 10 - 15 working days

First published

2010

Editors

,

Dimensions

234 x 156 x 28mm (L x W x T)

Format

Paperback

Pages

504

Edition

2010 ed.

ISBN-13

978-1-84882-982-4

Barcode

9781848829824

Categories

LSN

1-84882-982-5



Trending On Loot