Tag: data mining

  • Fundamentals of Data Mining

    Fundamentals of Data Mining

    Data mining is the process of discovering these patterns among the data and is therefore also known as Knowledge Discovery from Data (KDD).

  • Free Book, Mining Massive Datasets, 2nd Edition

    A new edition of Mining Massive Datasets is now available. It is used for a number of data mining courses at colleges across the US (and globe). Here are just a few of the topics from the book. Map-reduce Clustering Recommendation Systems Dimensionality Reduction Social Network Analysis

  • Data Mining and Analysis Textbook (Free Download)

    Mohammed J. Zaki, Computer Science Professor at RPI, and Wagner Meira Jr., Computer Science Professor at Universidade Federal de Minas Gerais, have written the textbook Data Mining and Analysis: Fundamental Concepts and Algorithms. The book is currently available as a PDF download. Based upon the chapters, the book looks very good. It contains large sections…

  • Data Mining MOOC

    The University of Waikato in New Zealand will be offering a free online course titled, Data Mining with Weka. Weka is a widely-used toolkit for data mining and machine learning. The University of Waikato developed the toolkit. Don’t wait too long to sign up, the course starts September 9, 2013. Here is a video of…

  • Data Mining Standard Processes

    There are a couple of standard processes for approaching data mining problems. CRISP-DM The most common approach is Cross Industry Standard Process for Data Mining (CRISP-DM). Steps of CRISP-DM Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment The steps are mostly self-explanatory, but the CRISP-DM wikipedia page has a lengthier description. SEMMA The second…

  • Win-Vector Blog » Data Science, Machine Learning, and Statistics: what is in a name?

    Win-Vector Blog » Data Science, Machine Learning, and Statistics: what is in a name?. This is an excellent write-up for the differences between: Statistics Machine Learning Data Mining Informatics Big Data Predictive Analytics Data Science

  • Best Free Data Mining Tools

    I recently saw the article, The Best Data Mining Tools You Can Use for Free in Your Company. It contains a very brief description of each of the following tools. RapidMiner RapidAnalytics Weka PSPP KNIME Orange Apache Mahout jHepWork Rattle See The Best Data Mining Tools You Can Use for Free in Your Company for…

  • Data Mining Map

    Dr. Saed Sayad, a Professor at the University of Toronto, has created a great diagram about the elements of Data Mining. It is map (mathematical tree structure) that shows many of the common techniques in data mining and when to apply each one. Note: you can click on individual elements in the map for more…

  • Programmer’s Guide to Data Mining – A free ebook

    Ron Zacharski is currently writing a data mining book, A Programmer’s Guide to Data Mining. The book is targeted at programmers that want to know when and how to apply recommendation engines and other data mining techniques. The book is still in the writing phase, but I can say the first couple chapters are excellent.…

  • Top ten algorithms in data mining (2007) [pdf] | Hacker News

    Top ten algorithms in data mining (2007) [pdf] | Hacker News. The discussion below the link is also very good. If you are curious, here are the 10 algorithms, and the paper is displayed below. C4.5 k-Means SVM Apriori EM PageRank AdaBoost kNN Naive Bayes CART

  • Free Textbook: Mining of Massive Datasets

    A few professors from Stanford University have released version 1.1 of their textbook, Mining of Massive Datasets. The book has been created from materials used for a couple of Stanford computer science classes including large-scale data-mining and web mining. The book looks excellent and really focuses on the analysis of data at a large scale.…