by farhanhubble
1 subcomments
- I loved his Statistics for Hackers talk: https://speakerdeck.com/pycon2016/jake-vanderplas-statistics...
- These types of books are always interesting to me because they tackle so many different things. They cover a range of topics at a high level (data manipulation, visualization, machine learning) and each could have its own book. They balance teaching programming while introducing concepts (and sometimes theory).
In short I think it's hard to strike an appropriate balance between these but this seems to be a good intro level book.
- This book was absolute fire for getting started with data science in 2017-2018, Jake is a great teacher.
- Interesting choice of Pandas in this day and age. Maybe he’s after imparting general concepts that you could apply to any tabular data manipulator rather than selecting for the latest shiny tool.
by pantsforbirds
0 subcomment
- I used the Kernel Density Estimation (KDE) page/blog at my very first job. It was immensely useful and I've loved his work ever since.
- He's a great writer and I miss his blog. He had an awesome post on pivot table that I think is now a part of this book.
by refactor_master
1 subcomments
- I honestly don't get why you'd hate pandas more than anything else in the Python ecosystem. It's probably not the best tool in the world, and sure, like everybody else I'd rewrite the universe in Rust if I could start over, and had infinite time to catch up.
But the code base I work on has thousands and THOUSANDS of lines of Pandas churning through big data, and I can't remember the last time it lead to a bug or error in production.
We use pandas + static schema wrapper + type checker, so you'll have to get exotic to break things.
by synergy20
2 subcomments
- it's written 8 years ago though, there is a 2ed of the book by the same author.
by runningmike
0 subcomment
- https://learningds.org/intro.html
Cc-by-nc-nd
- This is one of the few books that I read cover-to-cover when I was starting out learning Data Science in 2020/21. Will recommend.
- I wouldn't say it's a handbook because it's more like an introduction. But it's pretty well written.
by badmonster
1 subcomments
- [flagged]
- Pandas is cancer. Please stop teaching it to people.
Everything it does can be done reasonable well with list comprehensions and objects that support type annotations and runtime type checking (if needed).
Pandas code is untestable, unreadable, hard to refactor and impossible to reuse.
Trillions of dollars are wasted every year by people having to rewrite pandas code.