Tag: nosql

  • Huge List of Big Data and Machine Learning Technologies

    Onur Akpolat has put together A curated list of awesome big data frameworks and resources. The list is very extensive and includes: NoSQL databases, machine learning libraries, frameworks, filesystems and more. On a similar note, Joseph Misiti has compiled a large list of machine learning specific resources. The list is titled, Awesome Machine Learning, and…

  • 7 Important Data Science Papers

    7 Important Data Science Papers

    It is back-to-school time, and here are some papers to keep you busy this school year. All the papers are free. This list is far from exhaustive, but these are some important papers in data science and big data. Google Search PageRank – This is the paper that explains the algorithm behind Google search. Hadoop…

  • Nice GraphDB and NoSQL Talk

    This is a wonderful talk by Max DeMarzi (he has a very informative blog as well). If you are new to NoSQL or Graph Databases, I highly recommend this video. One comment stuck out for me: You’re never gonna run out of nodes when you get to half a trillion… That is a really big…

  • 2 Recently Released Open Source Graph-Related Projects

    GraphBuilder Intel Labs built a tool for constructing mathematical graphs out of large datasets. It is Java based and works with Hadoop and MapReduce. Intel has release a whitepaper explaining more about GraphBuilder. The code is available on Github. A big thanks to Mark Nickel for pointing out this project. ArangoDB ArangoDB is a flexible…

  • 3 Secrets for Aspiring Data Scientists | Software Advice

    Michael Koploy wrote 3 Secrets for Aspiring Data Scientists about what it takes to enter a career as a data scientist. He lays out 3 steps: Sharpen Your Scientific Saw – Hone your math and science skills Learn the Language of Business – Data Scientists need to explain the data in business terms Keep Adding…

  • Java and MongoDB Webinars

    10gen, the company behind MongoDB, will be offering some free webinars this fall. This webinar series is targeted at using MongoDB with Java. 10gen has been running successful webinars for a long time, so I would high recommend any/all of the following sessions. Title Date Building your first Java Application with MongoDB Oct. 18, 2012…

  • A Comparison of NoSQL Offerings

    Kristof Kovacs put together an excellent comparisons of the different NoSQL products. Here are the products it covers. MongoDB Riak CouchDB Redis HBase Neo4j Cassandra Membase

  • Neo4j and Bioinformatics Webinar

    Neo Technology, the company behind the graph database Neo4j, is hosting a webinar on Thursday. Pablo Pareja from the Bio4j project will provide an overview of bioinformatics and neo4j, as well as some applications. Bioinformatics can be viewed as data science for biology. Bioinformatics was cool before data science was even a term. If you…

  • Challenge To Future Developers: Start Storing More Data

    Dear Future Developers Please store as much data as possible. Do not worry about the cost of the extra storage disks. The value in the data will far outweigh the cost of the hardware. Here are some examples of data that could be stored but is typically not. Start storing data about the order in…

  • Twitter, NoSQL and Data Analysis

    This is a lengthy but very good slide deck on the what/why of the tools used at Twitter. Note: The slide deck is about 2 years old. NoSQL at Twitter (NoSQL EU 2010) View more presentations from Kevin Weil