Your cart is empty
"Data Mining: Practical Machine Learning Tools and Techniques" offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
Thorough updates reflect the technical changes and
modernizations that have taken place in the field since the last
edition, including new material on Data Transformations, Ensemble
Learning, Massive Data Sets, Multi-instance Learning, plus a new
version of the popular Weka machine learning software developed by
the authors. Witten, Frank, and Hall include both tried-and-true
techniques of today as well as methods at the leading edge of
*Provides a thorough grounding in machine learning concepts as well as practical advice on applying the tools and techniques to your data mining projects *Offers concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods *Includes downloadable Weka software toolkit, a collection of machine learning algorithms for data mining tasks-in an updated, interactive interface. Algorithms in toolkit cover: data pre-processing, classification, regression, clustering, association rules, visualization
The increasing volume of data in modern business and science calls for more complex and sophisticated tools. Although advances in data mining technology have made extensive data collection much easier, it s still always evolving and there is a constant need for new techniques and tools that can help us transform this data into useful information and knowledge.
Since the previous edition s publication, great advances have
been made in the field of data mining. Not only does the third of
edition of "Data Mining: Concepts and Techniques" continue the
tradition of equipping you with an understanding and application of
the theory and practice of discovering patterns hidden in large
data sets, it also focuses on new, important topics in the field:
data warehouses and data cube technology, mining stream, mining
social networks, and mining spatial, multimedia and other complex
data. Each chapter is a stand-alone guide to a critical topic,
presenting proven algorithms and sound implementations ready to be
used directly or with strategic modification against live data.
This is the resource you need if you want to apply today s most
powerful data mining techniques to meet real business
* Presents dozens of algorithms and implementation examples, all in pseudo-code and suitable for use in real-world, large-scale data mining projects. * Addresses advanced topics such as mining object-relational databases, spatial databases, multimedia databases, time-series databases, text databases, the World Wide Web, and applications in several fields. *Provides a comprehensive, practical look at the concepts and techniques you need to get the most out of your data"
The demand for SQL information and training continues to grow
with the need for a database behind every website capable of
offering web-based information queries. SQL is the de facto
standard for database retrieval, and if you need to access, update,
or utilize data in a modern database management system, youwill
need SQL to do it. TheSecond Editionof "Joe Celko's Trees and
Hierarchies in SQL for Smarties" covers two new sets of extensions
over three entirelynew chapters and expounds upon the changes that
have occurred in SQL standards since the previous edition's
publication. Benefit from mastering the challenging aspects of
these database applications in SQL as taught by Joe Celko, one of
the most-read SQL authors in the world.
*Expert advice from a noted SQL authority and award-winning columnist who has given 10 years of service to the ANSI SQL standards committee
*Teaches scores of advanced techniques that can be used with any product, in any SQL environment
*Offers graph theory and programming techniques for working around deficiencies and gives insight into real-world challenges"
Information Modeling and Relational Databases, second edition,
provides an introduction to ORM (Object-Role Modeling)and much
more. In fact, it is the only book to go beyond introductory
coverage and provide all of the in-depth instruction you need to
transform knowledge from domain experts into a sound database
design. This book is intended for anyone with a stake in the
accuracy and efficacy of databases: systems analysts, information
modelers, database designers and administrators, and programmers.
Making the move from an IT technician or team member to management is one of the most difficult career steps you ll face. Help from management and targeted training can be hard to come by - and your success depends on your ability to adapt to your new role almost overnight. You might have years of experience in the trenches, but you ll quickly find that managing a team, setting budgets, and creating a winning strategy for the first time can be daunting tasks.
Now in its third edition, "IT Manager s Handbook" provides a
practical reference that you will return to again and again in an
ever-changing corporate environment where the demands on IT
continue to increase. Make your first 100 days really count with
the fundamental principles and core concepts critical to your
success as a new IT Manager. The book also includes discusses how
to develop an overall IT strategy as well as demonstrate the value
of IT to the company.
In this book, you ll learn how to: Manage your enterprise s new level of connectivity with a NEW chapter covering social media, handheld devices, and moreImplement and optimize cloud services to provide a better experience for your mobile and virtual workforce at a lower cost to your bottom lineIntegrate mobile applications into your company s strategyManage the money, including topics such as department budgets and leasing versus buyingWork with your "customers," whomever those might be for your IT shopHire, train, and manage your team and their projects so that you come in on time and budgetSecure your systems to face some of today's most challenging security challenges"
This book brings all of the elements of database design together in
a single volume, saving the reader the time and expense of making
multiple purchases. It consolidates both introductory and advanced
topics, thereby covering the gamut of database design methodology ?
from ER and UML techniques, to conceptual data modeling and table
transformation, to storing XML and querying moving objects
Joe Celkos SQL for Smarties: Advanced SQL Programming offers tips and techniques in advanced programming. This book is the fourth edition and it consists of 39 chapters, starting with a comparison between databases and file systems. It covers transactions and currency control, schema level objects, locating data and schema numbers, base tables, and auxiliary tables. Furthermore, procedural, semi-procedural, and declarative programming are explored in this book. The book also presents the different normal forms in database normalization, including the first, second, third, fourth, fifth, elementary key, domain-key, and Boyce-Codd normal forms. It also offers practical hints for normalization and denormalization. The book discusses different data types, such as the numeric, temporal and character data types; the different predicates; and the simple and advanced SELECT statements. In addition, the book presents virtual tables, and it discusses data partitions in queries; grouping operations; simple aggregate functions; and descriptive statistics, matrices and graphs in SQL. The book concludes with a discussion about optimizing SQL. It will be of great value to SQL programmers. KEY FEATURES * Expert advice from a noted SQL authority and award-winning columnist who has given ten years service to the ANSI SQL standards committee * Teaches scores of advanced techniques that can be used with any product, in any SQL environment, whether it is an SQL 92 or SQL 2008 environment * Offers tips for working around deficiencies and gives insight into real-world challenges
Perfectly intelligent programmers often struggle when forced to
work with SQL. Why? Joe Celko believes the problem lies with their
procedural programming mindset, which keeps them from taking full
advantage of the power of declarative languages. The result is
overly complex and inefficient code, not to mention lost
In this complete revision and expansion of his first SQL Puzzles
book, Joe Celko challenges you with his trickiest puzzles and then
helps solve them with a variety of solutions and explanations. Joe
demonstrates the thought processes that are involved in attacking a
problem from an SQL perspective to help advanced database
programmers solve the puzzles you frequently face. These techniques
not only help with the puzzle at hand, but help develop the mindset
needed to solve the many difficult SQL puzzles you face every day.
Of course, part of the fun is to see whether or not you can write
better solutions than Joe s.
Principles of Transaction Processing is a comprehensive guide to developing applications, designing systems, and evaluating engineering products. The book provides detailed discussions of the internal workings of transaction processing systems, and it discusses how these systems work and how best to utilize them. It covers the architecture of Web Application Servers and transactional communication paradigms. The book is divided into 11 chapters, which cover the following: Overview of transaction processing application and system structureSoftware abstractions found in transaction processing systemsArchitecture of multitier applications and the functions of transactional middleware and database serversQueued transaction processing and its internals, with IBM's Websphere MQ and Oracle's Stream AQ as examplesBusiness process management and its mechanismsDescription of the two-phase locking function, B-tree locking and multigranularity locking used in SQL database systems and nested transaction lockingSystem recovery and its failuresTwo-phase commit protocolComparison between the tradeoffs of replicating servers versus replication resourcesTransactional middleware products and standardsFuture trends, such as cloud computing platforms, composing scalable systems using distributed computing components, the use of flash storage to replace disks and data streams from sensor devices as a source of transaction requests. The text meets the needs of systems professionals, such as IT application programmers who construct TP applications, application analysts, and product developers. The book will also be invaluable to students and novices in application programming.
"Data Modeling Essentials, Third Edition" provides expert tutelage
for data modelers, business analysts and systems designers at all
levels. Beginning with the basics, this book provides a thorough
grounding in theory before guiding the reader through the various
stages of applied data modeling and database design. Later chapters
address advanced subjects, including business rules, data
warehousing, enterprise-wide modeling and data management.
Data analysis for database design is a subject of great practical
value to systems analysts and designers. This classic text has been
updated to include chapters on distributed database systems, query
optimisation and object-orientation.The SQL content now includes
features of SQL92 and SQL 99.
The potential business advantages of data mining are well
documented in publications for executives and managers. However,
developers implementing major data-mining systems need concrete
information about the underlying technical principles and their
practical manifestations in order to either integrate commercially
available tools or write data-mining programs from scratch. This
book is the first technical guide to provide a complete,
generalized roadmap for developing data-mining applications,
together with advice on performing these large-scale, open-ended
analyses for real-world data warehouses.
"How to Build a Digital Library" is the only book that offers
all the knowledge and tools needed to construct and maintain a
digital library, regardless of the size or purpose. It is the
perfectly self-contained resource for individuals, agencies, and
institutions wishing to put this powerful tool to work in their
burgeoning information treasuries. The Second Edition reflects new
developments in the field as well as in the Greenstone Digital
Library open source software. In Part I, the authors have added an
entire new chapter on user groups, user support, collaborative
browsing, user contributions, and so on. There is also new material
on content-based queries, map-based queries, cross-media queries.
There is an increased emphasis placed on multimedia by adding a
"digitizing" section to each major media type. A new chapter has
also been added on "internationalization," which will address
Unicode standards, multi-language interfaces and collections, and
issues with non-European languages (Chinese, Hindi, etc.). Part II,
the software tools section, has been completely rewritten to
reflect the new developments in Greenstone Digital Library
Software, an internationally popular open source software tool with
a comprehensive graphical facility for creating and maintaining
digital libraries. As with the First Edition, a web site,
implemented as a digital library, will accompany the book and
provide access to color versions of all figures, two online
appendices, a full-text sentence-level index, and an automatically
generated glossary of acronyms and their definitions. In addition,
demonstration digital library collections will be included to
demonstrate particular points in the book. to access the online
content please visit, http: //www.greenstone.org/howto
*Outlines the history of libraries-- both traditional and digital-- and their impact on present practices and future directions. *Written for both technical and non-technical audiences and covers the entire spectrum of media, including text, images, audio, video, and related XML standards. *Web-enhanced with software documentation, color illustrations, full-text index, source code, and more."
How do you approach answering queries when your data is stored in multiple databases that were designed independently by different people? This is first comprehensive book on data integration and is written by three of the most respected experts in the field.
This book provides an extensive introduction to the theory and concepts underlying today's data integration techniques, with detailed, instruction for their application using concrete examples throughout to explain the concepts. Data integration is the problem of answering queries that span multiple data sources (e.g., databases, web pages). Data integration problems surface in multiple contexts, including enterprise information integration, query processing on the Web, coordination between government agencies and collaboration between scientists. In some cases, data integration is the key bottleneck to making progress in a field.
The authors provide a working knowledge of data integration
concepts and techniques, giving you the tools you need to develop a
complete and concise package of algorithms and applications.
*Offers a range of data integration solutions enabling you to focus on what is most relevant to the problem at hand.
*Enables you to build your own algorithms and implement your own data integration applications
*Companion website with numerous project-based exercises and solutions and slides. Links to commercially available software allowing readers to build their own algorithms and implement their own data integration applications. Facebook page for reader input during and after publication.
Data is an expensive and expansive asset. Information capture has forced storage capacity from megabytes to terabytes, exabytes and, pretty soon, zetabytes of data. So the need for accessible storage space for this data is great. To make this huge amount of data usable and relevant, it needs to be organized effectively. Database Base Management Systems, such as Oracle, IBM s DB2, and Microsoft SqlServer are used often, but these are being enhanced continuously and auxiliary tools are being developed every week; there needs to be a fundamental starting point for it all. That stating point is Data Architecture, the blueprint for organizing and structuring of information for services, service providers, and the consumers of that data.
"Data Architecture: From Zen to Reality" explains the principles underlying data architecture, how data evolves with organizations, and the challenges organizations face in structuring and managing their data. It also discusses proven methods and technologies to solve the complex issues dealing with data. The book uses a holistic approach to the field of data architecture by covering the various applied areas of data, including data modelling and data model management, data quality, data governance, enterprise information management, database design, data warehousing, and warehouse design. This book is a core resource for anyone emplacing, customizing or aligning data management systems, taking the Zen-like idea of data architecture to an attainable reality.
Presents fundamental concepts of enterprise architecture with definitions and real-world applications and scenariosTeaches data managers and planners about the challenges of building a data architecture roadmap, structuring the right team, and building a long term set of solutions Includes the detail needed to illustrate how the fundamental principles are used in current business practice"
Fuzzy Modeling and Genetic Algorithms for Data Mining and
Exploration is a handbook for analysts, engineers, and managers
involved in developing data mining models in business and
government. As you'll discover, fuzzy systems are extraordinarily
valuable tools for representing and manipulating all kinds of data,
and genetic algorithms and evolutionary programming techniques
drawn from biology provide the most effective means for designing
and tuning these systems.
This isn't a book about the Object Data Standard; it's the
When it comes to storing objects in databases, ODMG 3.0 is
The definitive book on Oracle's Rdb database.
Do you need an introductory book on data and databases? If the
book is by Joe Celko, the answer is yes. "Data and Databases:
Concepts in Practice" is the first introduction to relational
database technology written especially for practicing IT
professionals. If you work mostly outside the database world, this
book will ground you in the concepts and overall framework you must
master if your data-intensive projects are to be successful. If
you're already an experienced database programmer, administrator,
analyst, or user, it will let you take a step back from your work
and examine the founding principles on which you rely every
day-helping you to work smarter, faster, and problem-free.
Whatever your field or level of expertise, Data and Databases
offers you the depth and breadth of vision for which Celko is
famous. No one knows the topic as well as he, and no one conveys
this knowledge as clearly, as effectively-or as engagingly. Filled
with absorbing war stories and no-holds-barred commentary, this is
a book you'll pick up again and again, both for the information it
holds and for the distinctive style that marks it as genuine
DB2 Universal Database (UDB) supports many different types of
applications, on many different kinds of data, in many different
software and hardware environments.
This book provides a complete guide to DB2 UDB Version 5 in all
its aspects, including the interfaces that support end users,
application developers, and database administrators. It is
complementary to the IBM product documentation, providing a clear
and informal explanation of how the features of DB2 were intended
to be used. It is an extensive revision of the author's earlier
book, "Using the New DB2: IBM's Object-Relational Database
The aim of query processing is to find information in one or
more databases and deliver it to the user quickly and efficiently.
Traditional techniques work well for databases with standard,
single-site relational structures, but databases containing more
complex and diverse types of data demand new query processing and
Most real-world data is not well structured. Today's databases
typically contain much non-structured data such as text, images,
video, and audio, often distributed across computer networks. In
this complex milieu
Principles of Database Query Processing for Advanced
Applications teaches the basic concepts and techniques of query
processing and optimization for a variety of data forms and
database systems, whether structured or unstructured.
You may like...
Web Server Technology
Nancy J Yeager, Robert E. McGrath Paperback R1,829 Discovery Miles 18 290
Relational Database Systems
Dan A. Simovici, Richard L. Tenney Hardcover R1,536 Discovery Miles 15 360
Atomic Transactions - In Concurrent and…
Nancy A. Lynch, Michael Merritt, … Hardcover R2,429 Discovery Miles 24 290
Query Processing for Advanced Database…
Johann Christoph Freytag, David Maier, … Hardcover R2,685 Discovery Miles 26 850
Understanding the New SQL - A Complete…
Jim Melton, Alan R. Simon Paperback R1,724 Discovery Miles 17 240
Building an Object-Oriented Database…
Francois Bancilhon, Claude Delobel, … Hardcover R2,699 Discovery Miles 26 990
Database Transaction Models for Advanced…
Ahmed K. Elmagarmid Hardcover R2,923 Discovery Miles 29 230
Dictionary of Information Science and…
Carolyn Watters Hardcover R1,518 Discovery Miles 15 180
Proceedings 1990 VLDB Conference - 16th…
Vldb Paperback R1,513 Discovery Miles 15 130
Database Programming Languages 2nd
Richard Hull, David Stemple, … Paperback R1,674 Discovery Miles 16 740