A weekly newsletter with the latest developments in Data Science and Machine Learning and Artificial Intelligence.
Welcome to the January 23th issue of the Sunday Briefing.
We’re currently working on no less than 3 different blog posts that unfortunately we weren’t able to finish in time to go to press, but that we’re expecting to be able to share next week. Meanwhile, we have recently published the celebratory Christmas Tree Animation post on the Visualization for Science substack, while in the Graphs for Science substack the latest post is: Markov Chains and PageRank: Quantifying node importance where we explore PageRank one of the worlds most profitable algorithms.
We’re proud to announce a new date for our new series of webinars, A Data Driven Approach to Understanding COVID-19 with NetworkX on Feb 8th, 2022. Be sure to register so you don’t miss out! This coming Wednesday, Jan 26th we also have our regular webinar series on Causality Why and What If — Causal Analysis for Everyone. A great introduction to one of the most fascinating topics in modern statistics and data science.
On our regularly scheduled content we explore the rise of A.I. fighter pilots, cognitive load theory and its applications for learning and how to implement a probabilistic programming language in 70 lines of Python.
From the Ivory Tower, we explore forgetting as a form of adaptive engram cell plasticity, and two amazing new datasets: conspiracy mentality and political orientation across 26 countries and “I can’t keep it up anymore.” The Voat.co dataset. On the more theoretical side, we have a look at assortative Mixing in Weighted Directed Networks.
This weeks ‘Data Science Book’ highlight is Data Science Book is “Hands-On Data Visualization: Interactive Storytelling From Spreadsheets to Code” by J. Dougherty, I. Ilyankou. As always you can find all the previous book recommendations on our website. In the video of the week we have a lecture on Finding clusters in Graphs.
Data shows that the best way for a newsletter to grow is by word of mouth, so if you think one of your friends or colleagues would enjoy this newsletter, just go ahead and forward this email to them. This will help us spread the word!
The D4S Team
The latest post on the Graphs for Data Science substack: Markov Chains and PageRank: Quantifying node importance is now out.You should Sign Up to make sure you never miss a post!
The latest post in the CoVID-19 series, ‘How to model the effects of vaccination’ takes a look at how simple modifications of the SIR model can help us better understand how vaccines work. As usual, all the code is available in GitHub: http://github.com/DataForScience/Epidemiology101
The latest post in the Causality series covers section ‘3.7 — Mediation’, a recipe to calculate the controlled directed effect. The code for each blog post in this series is hosted by a dedicated GitHub repository: https://github.com/DataForScience/Causality
This weeks Data Science Book is “Hands-On Data Visualization: Interactive Storytelling From Spreadsheets to Code” by J. Dougherty, I. Ilyankou. Data visualization, and in particular, storytelling with Data is an often overlooked skill that can help you stand out among your peers. Being able to explain your results, highlight the key points and messages can help you maximize your professional impact. This book is one of the best introductions to Data Visualization and storytelling using a wide range of tools and techniques that are presented through hands-on, step by step, tutorials. The book focuses on generating interactive visualizations that can be embedded within your customer facing website or internal dashboard using a wide range of practical tools.
Tutorials and blog posts that came across our desk this week.
- The Rise of A.I. Fighter Pilots [newyorker.com]
- Cognitive Load Theory and its Applications for Learning [scotthyoung.com]
- Reversing an integer hash function [taxicat1.github.io]
- What is a Lagrange point, the final destination for the James Webb Space Telescope? [washingtonpost.com]
- A probabilistic programming language in 70 lines of Python [mrandri19.github.io]
- Strict Python function parameters [sethmlarson.dev]
- Abuse and harassment on the blockchain [blog.mollywhite.net]
Some of the most interesting academic papers published recently
- Forgetting as a form of adaptive engram cell plasticity (T. J. Ryan, P. W. Frankland)
- Conspiracy mentality and political orientation across 26 countries (R. Imhoff, F. Zimmer, O. Klein, J. H. C. António, M. Babinska, A. Bangerter, M. Bilewicz, N. Blanuša et al)
- Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents (R. Köster, D. Hadfield-Menell, R. Everett, L. Weidinger, G. K. Hadfield, J. Z. Leibo)
- A social perspective on perceived distances reveals deep community structure (K. S. Berenhaut, K. E. Moore, R. L. Melvin)
- Changes in COVID-19 Vaccine Hesitancy Among Black and White Individuals in the US (T. J. Padamsee, R. M. Bond, G. N. Dixon, S. R. Hovick, K. Na, E. C. Nisbet, D. T. Wegener, R. K. Garrett)
- Millions of Co-purchases and Reviews Reveal the Spread of Polarization and Lifestyle Politics across Online Markets (A. Ruch, A. Decter-Frain, R. Batra)
- ”I can’t keep it up anymore.” The Voat.co dataset (A. Mekacher, A. Papasavva)
- Assortative Mixing in Weighted Directed Networks (U. Pigorsch, M. Sabek)
Interesting discussions, ideas or tutorials that came across our desk.
Finding Clusters in Graphs
All the videos of the week are now available in our Youtube playlist.
Opportunities to learn from us:
- Jan 26, 2022 — Why and What If — Causal Analysis for Everyone [Register]
- Feb 03, 2022 — Data Visualization with matplotlib and seaborn for Everyone [Register]
- Feb 08, 2022 — A Data Driven Approach to Understanding COVID-19 with NetworkX [Register] 🆕
- Feb 25, 2022 — Deep Learning for Everyone [Register]
- Mar 04, 2022 — Time Series for Everyone [Register] 🆕
- Mar 07, 2022 — Advanced Time Series for Everyone [Register] 🆕
Long form tutorials:
- Natural Language Processing 5.5h, covering basic and advancing techniques using NLTK and Keras
- Times Series Analysis for Everyone 6h covering data pre-processing, visualization, ARIMA, ARCH and Deep Learning models
Thank you for subscribing to our weekly newsletter with a quick overview of the world of Data Science and Machine Learning. Please share with your contacts to help us grow!
Publishes on Sunday.