Tag: open source

  • Quantum Computing with @hellodavidryan: TDI 24

    “Within ten years there will be everything from small form factor QPUs doing 10 to 30 qubits as a part of roaming networks of devices, through to crazy large quantum devices doing thousands of qubits” — David (@hellodavidryan) on Threads Today we have @hellodavidryan. How did you end up working in the Quantum Computing space?…

  • Open Source with @bendotcodes: TDI 14

    “we are stronger if we all share some of our works together than if we all kept everything to ourself.” — ben.codes (@bendotcodes) on Threads Today we have @bendotcodes. We are going to start out talking about open source software because that is a topic not yet covered in these interviews. In case someone new…

  • Do’s and Don’ts of Data Science

    Don’t Start with the Data Do Start with a Good Question Don’t think one person can do it all Do build a well-rounded team Don’t only use one tool Do use the best tool for the job Don’t brag about the size of your data Do collect relevant data Don’t ignore domain knowledge Do consult…

  • Dat – Version Controlled Data

    Dat is an open source project focusing on data storage. In particular, the project wants to version control data. What is version control? In short it allows for tracking of history associated with something (typically source code files or documents). Dat takes the idea a bit further, and the data is versioned at the row…

  • The New Open Data Handbook

    Originally published in 2012, the Open Data Handbook has released an second edition. The handbook is to be used as a guide for organizations or individuals interested in publishing and/or utilizing open data. The goal is ensuring data is open and that data is applied as often as possible. The second edition now includes 3…

  • Open Data Day 2015

    Today, February 21, 2015 is Open Data Day. What is it? Around the globe, cities are hosting hackathons centered around open data. The rules are fairly open-ended as long as the event is open and uses open data. Who is it for? Designers Developers Statisticians Librarians Citizens If you want to get involved, check the…

  • Open Source Alternatives to AWS

    Working with big data can often mean doing some cloud computing. If a public cloud like Amazon AWS is not an option, there are some open source alternatives. They all offer some level of compatibility with the AWS API for both EC2(compute) and S3(storage). Rackspace OpenStack Apache CloudStack Eucalyptus OpenNubula

  • Best Free Data Mining Tools

    I recently saw the article, The Best Data Mining Tools You Can Use for Free in Your Company. It contains a very brief description of each of the following tools. RapidMiner RapidAnalytics Weka PSPP KNIME Orange Apache Mahout jHepWork Rattle See The Best Data Mining Tools You Can Use for Free in Your Company for…

  • 50 Top Open Source Tools for Big Data – Datamation

    50 Top Open Source Tools for Big Data – Datamation. The list is about 6 months old, but it still covers all the ones I would have listed and quite a few more.

  • Big Data Right Now: Five Trendy Open Source Technologies | TechCrunch

    Open Source Software can be great. TechCruch lists 5 fairly new open source technologies for big data. This is probably a good list to pay attention to for the near future. Storm Drill R Gremlin SAP Hana If you are unfamiliar with some of software on the list, please read the article for more details.…