All books / Book

Data Mining for the Social Sciences: An Introduction

Full title: Data Mining for the Social Sciences: An Introduction
ISBN: 9780520280984
ISBN 10: 0520280989
Authors:
Publisher: University Of California Press
Edition: First
Num. pages: 264
Binding: Paperback
Language: en
Published on: 2015

Read the reviews and/or buy it on Amazon.com

Synopsis

We Live, Today, In World Of Big Data. The Amount Of Information Collected On Human Behavior Every Day Is Staggering, And Exponentially Greater Than At Any Time In The Past. At The Same Time, We Are Inundated By Stories Of Powerful Algorithms Capable Of Churning Through This Sea Of Data And Uncovering Patterns. These Techniques Go By Many Names - Data Mining, Predictive Analytics, Machine Learning - And They Are Being Used By Governments As They Spy On Citizens And By Huge Corporations Are They Fine-tune Their Advertising Strategies. And Yet Social Scientists Continue Mainly To Employ A Set Of Analytical Tools Developed In An Earlier Era When Data Was Sparse And Difficult To Come By. In This Timely Book, Paul Attewell And David Monaghan Provide A Simple And Accessible Introduction To Data Mining Geared Towards Social Scientists. They Discuss How The Data Mining Approach Differs Substantially, And In Some Ways Radically, From That Of Conventional Statistical Modeling Familiar To Most Social Scientists. They Demystify Data Mining, Describing The Diverse Set Of Techniques That The Term Covers And Discussing The Strengths And Weaknesses Of The Various Approaches. Finally They Give Practical Demonstrations Of How To Carry Out Analyses Using Data Mining Tools In A Number Of Statistical Software Packages. It Is The Hope Of The Authors That This Book Will Empower Social Scientists To Consider Incorporating Data Mining Methodologies In Their Analytical Toolkits--provided By Publisher. What Is Data Mining? -- Contrasts With The Conventional Statistical Approach -- Some General Strategies Used In Data Mining -- Important Stages In A Data Mining Project -- Preparing Training And Test Datasets -- Variable Selection Tools -- Creating New Variables Using Binning And Trees -- Extracting Variables -- Classifiers -- Classification Trees -- Neural Networks -- Clustering -- Latent Class Analysis And Mixture Models -- Association Rules. Paul Attewell And David B. Monaghan, With Darren Kwong. Includes Bibliographical References And Index.