Cada vez es más complicado seguir la actualidad del Big Data (en el enlace anterior teneis nuestra recopilación de lo más destacado publicado en el portal), por eso es muy interesante la recopilación de Data Science Central sobre el tema:
February 5, 2015
Code for learning the Structure of Graphical Models
PokitDok HealthGraph
Data Wrangling with dplyr and tidyr Cheat Sheet
Deep Learning in a Nutshell
Do we Need Hundreds of Classifiers to Solve Real World Classificati...- PDF document
Video: Advanced Machine Learning with scikit-learn
Predictive Modeling with R and the caret Package
Protovis: A Graphical Toolkit for Visualization
R Data: Data Analysis and Visualization Using R
How to Choose Between Learning Python or R First
Top 50 open source web crawlers for data mining
Year 2014 in Review as Seen by a Event Detection System
Optimization Algorithms in Machine Learning
Machine Learning course Video
Course from Rice University: An Introduction to Interactive Program...
MapReduce: Simplified Data Processing on Large Clusters
MapReduce Online
Distributed Hash Tables, Part I
One Page R: A Survival Guide to Data Science with R
Abridged List of Machine Learning Topics
Decision Tree Algorithms – Simplified
DataQuest - Browser-based learning for data science
January 23, 2015
How To Implement These 5 Powerful Probability Distributions In Python
Median Selection Subset Aggregation for Parallel Inference
The caret Package - Short for Classification And REgression Training
Bayesian Machine Learning on Apache Spark
How to Visualize Website Clickstream Data
Practical Data Science in Python
Starting data analysis/wrangling with R - Things I wish I'd been told
Sibyl: A System for Large Scale Machine Learning at Google - Video
Top 77 R posts for 2014
Implementing K-means Clustering to Classify Bank Customer
Data Animations With Python and MoviePy
A Young Person’s Guide to C# Bond
Video: Advanced Machine Learning with scikit-learn
pbdR: programming with big data in R
14 Best Python Pandas Features
Deep Learning in a Nutshell
Big Data for Predictive Machine Learning and Data Mining - Research paper, Cornell
R Markdown - About repoducibility of research experiments
January 6, 2015
Machine Learning Discussion Group - Deep Learning with Stanford AI Lab (Video 1 of 3)
Univariate Distribution Relationships - 76 probability distributions
Abridged List of Machine Learning Topics
Deep Learning in Neural Networks: An Overview
Using Word Clouds for Topic Modeling Results
The Split-Apply-Combine Strategy for Data Analysis
Open source dashboard templates
Configuring a Linux Virtual Machine for Data Science - Step-by-step guide, with Python, R and GIT
Do-it-yourself Crawlers vs. Crawlers as Service
Abridged List of Machine Learning Topics
Recommender Systems 101 – a step by step practical example in R
Programming tools: Adventures with R
Introductory R Presentation
What is a Bayesian Network?
December 24, 2014
Map-Reduce for Machine Learning on Multicore - PDF document
A Map-Reduce Algorithm for Matrix Multiplication
HAMA: An Efficient Matrix Computation with the Map-Reduce Framework - PDF document
Deep Neural Networks are Easily Fooled: High Confidence Predictions...
Video: An Overview of Deep Learning and Its Challenges for Technica...
Representation Learning: A Review and New Perspectives
5 Amazingly powerful Python libraries for data science
DIY Crawlers vs. Crawlers as Service
20 new data viz tools and resources of 2014
JavaScript data visualization for R
Controversies in the Foundations of Statistics - Research paper (1978)
December 10, 2014
An open source repository for responsive dashboard templates
Do we Need Hundreds of Classifiers to Solve Real World Classificati...- PDF document (MIT)
A Dozen Informative Videos on Data Science
An Introduction to Unsupervised Learning via Scikit Learn
30 data visualization tools
14 Best Python Pandas Features
What do practitioners need to know about regression?
10 Big data and analytics tutorials in 2014 - From IBM
Deep Neural Networks are Easily Fooled - PDF document
What is deep learning? - PDF document
Book: Statistics with R
Automatically making sense of data
December 3, 2014
Simple CSV Data Wrangling with Python
Best Practices for Hadoop. Learn the best practices for applying ad...
Data Science in the Statistics Curricula: Preparing Students to "Th...
R, an Integrated Statistical Programming Environment and GIS
Video: Introduction to Deep Learning with Python
Resources regarding the Julia programming language
Interpreting Confidence Intervals
The learning behind gmail priority inbox
November 17, 2014
Geoffrey Hinton on Deep Learning
Python Packages For Data Mining
Deep Learning Tutorial
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Tuning Machine Learning Models Using the Caret R Package
Getting Started with Deep Learning and Python
November 6, 2014
Hacker's guide to Neural Networks
2015: the Year of Big Data * - Warwick Data Science Institute
Exercise to compare classifier performance
10 Tips for Better Deep Learning Models
Running R in the Azure ML cloud
Getting Started with Deep Learning and Python
Foundations of Data Science by John Hopcroft & Ravindran Kannan
Videos: 20th ACM SIGKDD Conference on Knowledge Discovery and Data ...
October 24, 2014
Tutorial about Deep Belief Network in Python
Cheat sheets for developers
In-depth introduction to machine learning in 15 hours of expert videos
The Python Tutorial
Meta-list of data set repositories for cool data science projects
K Means Clustering - Effect of random seed
Demographic and lifestyle information by Zipcode - Interesting, but they don't provide education or age breakdown
Equitability, mutual information, and the maximal information coeff...(Research paper)
New approach to engineering analytics for deployment in streaming a...(Research paper)
50 Face Recognition APIs
Prediction intervals too narrow
October 16, 2014
Tutorial To Implement k-Nearest Neighbors in Python From Scratch
Exercise to detect Algorithmically Generated Domain Names
Deep Learning Tutorials
Popularity rankings: How to do it Right
How to Prepare Data For Machine Learning
swirl teaches you R programming and data science interactively
Data Science at the Command Line
Overfitting and Machine Learning
ADW, free software to measure semantic similarity
In-depth introduction to machine learning in 15 hours of expert videos
Hacker's guide to Neural Networks
Visualizing MNIST: An Exploration of Dimensionality Reduction