Ramnath design rCharts wich combine the powerful of open source R and D3.js. It give very pretty plots. Here is some code, using package XML to collect the data from http://www.sochi2014.com/fr/...
http://rformining.blogspot.com/2014/02/sochi-2014-r-d3js.html
I don’t exactly know where to start. But, after a real pleasant discussion with one of my ex colleague, it seems that there are many thongs around Hadoop ecosystem and R for analyst that should...
http://rformining.blogspot.com/2013/12/hadoop-for-rs-data-scientist.html
K-plus proches voisins K-PLUS PROCHES VOISINS FONDAMENTAUX La notion de voisinage d'un point est assez intuitive. Une définition simple serait : une zone de l'espace qui comprend ce point. ...
http://rformining.blogspot.com/2013/11/les-k-plus-prcoches-voisins-vite-il.html
In my last post, I point the Road to data science , imagined by Swami. I think this road is too long and we can't make any difference between the basics (we have to know) and the advanced(it's ...
http://rformining.blogspot.com/2013/10/myown-way-to-data-science.html
Read away, A interesting post about skills to become Data Scientist. The post is about Where to start? When do you start seeing light at the end of the tunnel? What is the learning roadm...
http://rformining.blogspot.com/2013/10/road-to-data-scientist-by-swami.html
CONSTRUIRE UN MOTEUR DE RECHERCHE On va montrer comment on peut construire un moteur de recommandation simple en utilisant les outils de Textmining. Cette construction se fera en deux tem...
http://rformining.blogspot.com/2013/09/construire-un-moteur-de-reco-simple.html
CE QUE JE SAIS SUR LES SÉRIES TEMPORELLES (1/5) En lisant la préface de Flore Vasseur à l'essai “le monde en 2030 vu par la CIA”, j'ai été très frappé par mon manque de culture mac...
http://rformining.blogspot.com/2013/09/ce-que-je-sais-sur-les-series.html
INTRODUCTION Le classifieur naïf bayésien est l'une des méthodes les plus simples en apprentissage supervisé basée sur le théorème de Bayes. il est peu utilisé par les praticiens du ...
http://rformining.blogspot.com/2013/08/classifieur-naif-bayesien_5.html
Robert Mathews said that : "Ronald Fisher gave scientists a mathematical machine for turning baloney into breakthroughs, and ukes into funding. It is time to pull the plug.". He's right. In one ...
http://rformining.blogspot.com/2013/07/what-are-my-chances-to-talk-to-this.html
Supposons que l'on dispose d'iris de Paris (en population >100khabts) et qu'on veuille pouvoir les classer selon leurs caractéristiques sociodémos : Population taux de chômage Etudiants CSP e...
http://rformining.blogspot.com/2013/07/analyse-discriminante-lineaire-ou.html
ggmap is a new tool which enables such visualization by combining the spatial information of static maps from Google Maps, OpenStreetMap, Stamen Maps or CloudMade Maps with the layered grammar o...
http://rformining.blogspot.com/2013/07/ggmap-interesting-toolbox-for-spatial.html
Here , or there , I read many techniques to import a large dataset in R. The option read.table or read.csv doesn't work anyway because, as discusshere , R load in memory. And sometimes, whe...
http://rformining.blogspot.com/2013/06/how-to-read-large-dataset.html
Discussing with a non statistician colleague, it seems that the logistic regression is not intuitive; Some basics questions like : - Why don't use the linear model? - What's logistic functi...
http://rformining.blogspot.com/2013/06/how-logistic-regression-work.html
After reading this post (thanks to him ), I think it could be interesting to replicate this with some specific up of french language and to see and we can perform rapid view of the debate between...
http://rformining.blogspot.com/2013/05/mining-last-french-presidential-debate.html
Quandl is a new database management tool which seeks to become the place to find datasets. That is, each unique indicator is considered an independent data set. This helps them to seem to have ...
http://rformining.blogspot.com/2013/05/a-new-package-quandl.html
Petit monitoring de notre observatoire des médias sur Twitter. CHEZ MEDIAPART : LE MONDE LE FIGARO LE PARISIEN VUE GLOBALE Le code pour réaliser ce post :
http://rformining.blogspot.com/2013/05/monitoring-des-medias.html
JOURNEE R LE 24/05/2013 A PARIS - MUSEUM NATIONAL D'HISTOIRE NATURELLE VENEZ PARTAGER VOTRE (ME)CONNAISSANCE DE R ! Au programme : chimie, rapports automatisés, mélanges gaussiens, analyse ...
http://rformining.blogspot.com/2013/04/seminr-au-museum-dhistoire-naturelle.html
On va utiliser les packages TWITTER, WORLCOUD pour dire à partir de twitter de quoi a parlé la France cette dernière semaine. On essayera dans un avenir proche de développer un outil de Mon...
http://rformining.blogspot.com/2013/04/quon-dit-les-medias-cette-semaine.html
On va dans ce post, illustrer une utilisation simple des packages TWITTER, STREAMR, TM qui permettent faire du textmining. En réalité, les deux premiers permettent de récuperer les tweets et d...