We will be speaking at the Tuesday, September 3, 2019 BARUG. If you are in the Bay Area, please come see us.
Nina Zumel & John Mount
Practical Data Science with R
Practical Data Science with R (Zumel and Mount) was one of the first, and most widely-read books on the practice of doing Data Science using R. We have been working hard on an improved and revised 2nd edition of our book (coming out this Fall). The book reflects more experience with data science, teaching, and with R itself. We will talk about what direction we think the R community has been taking, how this affected the book, and what is new in the upcoming edition.
I am excited to announce
vtreat is now available for
Python on PyPi, in addition for
R on CRAN.
Continue reading vtreat up on PyPi
We, the community of Manning R and data science authors, have talked Manning into offering a catalog-wide 40% discount on all books. Please take a look at some great deals on some great technical books here: http://mng.bz/adRj !
We at Win-Vector LLC have some big news.
We are finally porting a streamlined version of our R vtreat variable preparation package to Python.
vtreat is a great system for preparing messy data for supervised machine learning.
The new implementation is based on Pandas, and we are experimenting with pushing the sklearn.pipeline.Pipeline APIs to their limit. In particular we have found the
.fit_transform() pattern is a great way to express building up a cross-frame to avoid nested model bias (in this case
.fit_transform() != .fit().transform()). There is a bit of difference in how object oriented APIs compose versus how functional APIs compose. We are making an effort to research how to make this an advantage, and not a liability.
The new repository is here. And we have a non-trivial worked classification example. Next up is multinomial classification. After that a few validation suites to prove the two vtreat systems work similarly. And then we have some exciting new capabilities.
The first application is going to be a shortening and streamlining of our current 4 day data science in Python course (while allowing more concrete examples!).
This also means data scientists who use both R and Python will have a few more tools that present similarly in each language.
Our publisher, Manning, is running a Memorial Day sale this weekend (May 24-27, 2019), with a new offer every day.
Fri: Half off all eBooks
Sat: Half off all MEAPs
Sun: Half off all pBooks and liveVideos
Mon: Half off everything
The discount code is:
Many great opportunities to get Practical Data Science with R 2nd Edition at a discount!!!
Also, Practical Data Science with R, 2nd Edition; Zumel, Mount; Manning 2019 is now content complete! It is deep into editing and soon into production!
A good friend is now a professor at the University of Auckland and knew to photograph and send us this. Thanks!!!
A good friend shared with us a great picture of Practical Data Science with R, 1st Edition hanging out in Cambridge at the MIT Press Bookstore.
This is as good an excuse as any to share a book update.
Continue reading Practical Data Science with R Book Update
Sunny day at the office, and new mocha cup!