The Emergence of Data Science Teams and the Challenges of Building Industrial-Grade Data Products
Drew, a data scientist at Alluvium, recently joined me on the show to discuss his team's approach to building data science teams and the unique challenges of building data science products for industrial users. Drew shared that he believes that data science teams should cover all aspects of their work, from data to insights and actionable recommendations. He emphasized the importance of communication across these different functions, highlighting that this is just as crucial as technical expertise.
Drew also discussed his team's approach to recruitment, noting that it reflects the job itself as much as possible. This means that they look for individuals who not only have strong technical skills but also a passion for data science and a willingness to learn and adapt. He emphasized the importance of building a diverse team with a range of skill sets and experiences.
One of the key challenges that Drew's team faces is the sheer volume of data that industrial users generate. This is significantly higher than what Twitter generates on a daily basis, making it essential for data scientists to have the right tools and methodologies in place to handle this volume. Drew highlighted the importance of developing tools and techniques that can scale to meet these demands.
Drew also discussed the role of development tools and methodology in building data science teams. He emphasized the need for collaboration between different departments, including engineering, product management, and business stakeholders. This requires a deep understanding of both technical and non-technical aspects of the business, as well as a willingness to adapt to changing requirements.
In addition to the technical challenges, Drew also highlighted the importance of managing data-driven systems in production. This involves ensuring that models are properly instrumented and measured, and that they can be compared and contrasted to determine which ones are performing better. He noted that this requires a different set of skills than those required for building individual models.
The conversation also touched on the topic of DevOps and development ops for data science teams. Drew emphasized the need for clear communication between different stakeholders, including developers, product managers, and business leaders. This involves defining clear goals and metrics, as well as establishing processes for deploying and maintaining data-driven systems.
Finally, Drew discussed his vision for the future of data science teams. He believes that these teams will continue to evolve and adapt to changing requirements, requiring a deep understanding of both technical and non-technical aspects of the business. He emphasized the importance of collaboration, communication, and continuous learning in building successful data science teams.
Actionable Advice from Drew
If you're interested in building a career as a data scientist or joining an existing team, Drew offers some actionable advice. Firstly, he recommends starting to build your skills and experience by writing and talking publicly about what you do. This will help you build a network of contacts and gain confidence in communicating complex ideas.
Drew also emphasizes the importance of being willing to learn and adapt. As data science continues to evolve and adapt to changing requirements, it's essential to stay up-to-date with the latest tools, techniques, and methodologies.
For those just starting out, Drew recommends taking a baby step towards teaching others about what you do. This can be as simple as giving a presentation at a meet-up or writing a blog post about your experiences. By doing so, you'll gain a deeper understanding of your own skills and ideas, as well as build a network of contacts who can provide support and guidance.
Recruitment at Alluvium
Alluvium is currently hiring for data scientists at both the mid and senior level, as well as entry-level positions. The company also has opportunities available for back-end engineers, DevOps engineers, and product engineers. If you're interested in joining the team, be sure to check out their careers page at alluvium.io/careers.
Conclusion
Drew's conversation with me highlighted the unique challenges of building data science teams and products for industrial users. From handling massive volumes of data to managing complex systems in production, these teams require a deep understanding of both technical and non-technical aspects of the business. By embracing collaboration, communication, and continuous learning, data science teams can build successful products that drive real value for their organizations.
As we look to the future of data science, it's essential to stay adaptable and willing to learn. Whether you're just starting out or looking to make a career shift, there are many opportunities available in this exciting field. By following Drew's advice and staying up-to-date with the latest tools and methodologies, you can build a successful career as a data scientist.
Data Science for Social Good
In our next episode, we'll be joined by Mara Avrech, data nerd at large and tidy verse development advocate at Alluvium. We'll discuss the role of data science in social good, including civic tech and sports analytics. We'll also explore the role of data science paradigms like the tidy verse in the data science ecosystem as a whole.
This conversation promises to be informative and thought-provoking, and we can't wait to dive in with Mara. Make sure to tune in for our next episode!