#92 Democratizing Data in Large Enterprises (with Meenal Iyer)

**Balancing Short-Term and Long-Term Wins: Managing Roadmaps with Data Scientists**

Managing roadmaps can be a challenging task, especially when it comes to balancing short-term wins with long-term goals. Data scientists, data engineers, and machine learning engineers are often at the forefront of creating and implementing these plans. In this article, we will explore how one expert, Minelle, manages her roadmap and balance between short-term and long-term wins.

**Minelle's Approach to Managing Roadmaps**

According to Minelle, she breaks down her roadmap into three main categories: 70% for strategic initiatives, 10% for ad hoc requests, and 20% for technical debt. She explains that this breakdown allows her to prioritize her work effectively and ensure that all aspects of the organization's technology strategy are addressed.

Minelle emphasizes that having a clear understanding of her team's strengths and weaknesses is crucial in managing roadmaps. "I have managed and we should be able to get all of these things done," she says, highlighting the importance of effective planning and resource allocation.

**The Power of Training Plans: Minelle's Experience with AWS Redshift**

One exciting project that Minelle worked on was migrating an organization's data to Amazon Web Services (AWS) Redshift. The project involved training a team of traditional data engineers, including herself, in cloud technologies. Minelle created a comprehensive training plan for her team, which took about six weeks to complete.

The outcome was impressive: the team successfully built out a full architecture, both batch and streaming, on the AWS Redshift platform, launching on time and with significant cost savings. Additionally, they were able to build a fraud detection model that could run in near real-time, resulting in a 92% accuracy rate.

Minelle is extremely proud of this project, which demonstrates the power of effective planning, training, and resource allocation. "I was so surprised at how my team just picked it up," she says, highlighting the importance of creating a supportive environment for learning and growth.

**Defining Data Democratization**

In her experience, data democratization goes beyond simply bringing data together into a central location. According to Minelle, it's about creating an interface that allows users to take ownership of their data and make informed decisions using data-driven insights.

"Data democratization is more than just solving data silos or centralizing data," Minelle explains. "It has to be an interface where the organization can say yes, we are moving in a data-driven fashion."

To achieve this, Minelle emphasizes the importance of providing users with the tools, access, and processes they need to succeed. This includes governance, privacy, and all the back-end processes that come with working with data.

**A Call to Action for Data Leaders**

As Minelle concludes her journey on the podcast, she offers a final call to action for data leaders: focus on creating an end-view for customers that puts them at the forefront of decision-making. "Tell your organization what that looks like," she advises. "Don't just focus on piping data in and bringing it all together."

By doing so, Minelle believes that data leaders can create a more empowered and self-sufficient organization, one that is well-equipped to harness the power of data to drive success.

**Conclusion**

In conclusion, managing roadmaps and balancing short-term wins with long-term goals requires effective planning, resource allocation, and training. Minelle's experience working on an AWS Redshift project highlights the importance of creating a supportive environment for learning and growth. By defining data democratization as more than just bringing data together, we can create an interface that empowers users to take ownership of their data.

As data leaders, it's up to us to create organizations that are driven by data-driven insights and empowered by self-sufficient teams. With the right approach, strategies, and training plans in place, we can harness the power of data to drive success and achieve our goals.

"WEBVTTKind: captionsLanguage: enyou're listening to data framed a podcast by data camp in this show you'll hear all the latest trends and insights in data science whether you're just getting started in your data career or you're a data leader looking to scale data driven decisions in your organization join us for in-depth discussions with data and analytics leaders at the forefront of the data revolution let's dive right in hi everyone this is adele data science educator and evangelist at datacamp if you've been listening to dataframe for a while now you probably know that data democratization is one of my favorite topics to discuss on the show every now and then i get to take a bird's-eye view with guests and discuss the broad ways organizations can become data-driven and this is one of these episodes today's guest is minal iyer minal is a data analytics strategist and transformational leader with over 22 years of experience building data analytics platforms and driving enterprises to be insights driven she has a wealth of knowledge and specialties including data literacy programs data monetization enterprise data analytics strategies amongst others she's led data teams at various retail organizations such as macy's tailored brands and more and she's one of the few data leaders i've spoken to that can really articulate a simple coherent strategy for data democratization throughout the episode we talk about the components of data democratization data culture and people the importance of standardizing business metrics to achieve data democratization which we spoke about at length how to enlist data champions as analytics leader and much more if you've enjoyed this episode make sure to rate subscribe and comment but only if you liked it also don't forget this week we'll be hosting datacamp radar our digital summit on june 23rd during the summit a variety of experts including minal from different backgrounds will be discussing everything related to the future of careers and data whether you're recruiting for their roles or looking to build a career in data there's definitely something for you seats are limited and registration is free so secure your spot today on events.datacam.com radar the link is in the description now on to today's episode minell it's great to have you on the show likewise adele thank you so much for the opportunity to speak in this show so i'm excited to deep dive with you on your work leading data science and tailored brands the importance of data democratization and how data leaders can really create a vibrant data culture but before can you give us a bit of a background about yourself absolutely so thanks again for letting me speak about data democratization it is one of my favorite topics to speak about so as terms of the background i've been in the space for about 20 over 20 years and have experienced like the good bad and ugly of the space worked across multiple industries which gave me also the experience to solve for unique data problems i enjoy or enabling data enterprises to essentially become data driven and you know that has kind of become my motto to provide the right information to the right people at the right time i'm currently working at tailored brands i had data science and experimentation here in addition to data and analytics that's really great so you mentioned here like data democratization is definitely the media today's conversation i want to set the stage for today's chat by first i'm backing what it means to democratize data you know we've seen a lot of organizations investing tons of resources and becoming data driven so i'd love to first start off the conversation to understand how you define data democratization if you ask me i think i found the best definition as something from what bernard marr basically said and what he said is and i quote uh data democratization means that everybody has access to data and there are no gatekeepers that create a bottleneck at the gateway to the data and the goal is to have anybody use data at any time to make decisions with no barriers to access or understanding and so if i have to kind of rephrase this a little bit i would just say data democratization is not just about data access but actually comfortable data access so you have a documented ready-to-use platform with tools and a culture that actually provides data-driven decision making and data literacy so in addition to just telling everyone that oh we have centralized all of our data access you actually need to make that access very very comfortable and convenient so end users can use it to become more self-service i love that concept of comfortable data access because i think a lot of people stop at just the data access component of it which really hurts the ability for people to make data-driven decisions at scale so as you mentioned there are definitely multiple components to democratizing data effectively this can range from scaling data access and data infrastructure centralizing data having like strong governed data as well culture and upskilling and a lot of different things that need to come together to be able to create a data-driven organization so i'd love it if you can walk us through the different ways organizations can accelerate their democratization absolutely i hope you have a pen and paper ready adele because it's a list of items so i would first say that if the organization is just starting with data democratization the first thing they need to have is an enterprise data strategy with a prioritized list of priorities as to what needs to be done in data itself the reason why this is so key and so critical is the fact that this is a multi-year journey and without the executive or enterprise support it is going to be almost impossible to make this mission a success if we do not have the strategy in place so this is very very key and very important so before anyone even embarks you have to know what is it that you're setting out to do and have basically your long-term vision set out once you have your data strategy set out you essentially start building out or thinking about a more forward-looking architecture so why we say forward-looking architecture is that you have to at least look out three to five years in terms of what your platform is going to be able to provide so you have your near-term priorities you have your strategic objectives for the organization where do you see your organizations in terms of data over the next three to five years and that's what your architecture is going to turn out to be so that you are not continuously you know doing migrations or not continuously trying to revisit this architecture and you are always in the process of building out and never at the point where the organization is fully self-sufficient with their data needs the third is so again we talk about where you try to reduce data silos within the organization so that's one key part of data democratization is to have access to the data vision within the organization itself now there may be some limitations to certain data which you may would not want to bring into a central warehouse or a central data lake and that is okay but for most part 85 to 90 percent of the organization's data essentially would bring would pipe into the system and so you need a centralized data team who basically helps with the piping and who helps essentially with managing the access across this the fourth is essentially data quality now this is very very key and very important to everything that you do you basically build data lineages and data quality as part of your process as you're piping data in so ensuring data quality will also give credibility in terms of what is being reported out of your platform and also when you get to the point where this goes to executors or when you start going into doing more data science or building data models and machine learning you know that your data is not skewed and or biased you can always trust the quality of the data so you trust the output of whatever is coming out of the data itself the fifth one and my most favorite is the semantic layer so the semantic layer is very very key just for the reason that that's the layer or the view that the end user has access to and this is the view that they use to essentially consume the data that exists within your ecosystem so when we talk about comfortable data access the semantic layer is the key to that comfortable data access because it tells your business users that okay you can basically query me without having to write very complicated sql or doing very complicated joins you have predefined metrics you have predefined kpis defined within that semantic layer so there is no reason for the organization to have confusion in terms of what is it that they're looking at or what is it that they're pulling out sixth is so once you have this platform in place you meet regularly with your stakeholders and you tell them okay and at every point in the journey actually you have to meet with your stakeholders and continue to update them on the status of what's going on and at this point what you should do is you have to be ready for some maybe reprioritization because these are organizational goals that you're trying to meet as part of the build out of this platform itself so you may expect some shift in priorities here or some additional priorities which you have to account for but the meeting with the stakeholders also ensures their continued support for this journey that you are on then you focus on basically the tools for the end users to use so these would be a standardized set of tools that essentially meet again the needs of almost 85 to 95 of your end user base there are going to be some end users who are going to have very specific use cases and you have to think about how you want to address those but for the bulk of the organization essentially you have to think about what are the standard tools that we have to provide to them so that they are able to get the most value out of this platform and then you have data stewards essentially identified for all of the data that is coming into this platform for each source of data that you're bringing into this platform you have data keywords identified so subject matter experts who essentially are able to help define or dictate as to what the rules of the data going from the system essentially coming is now these people also become your data stewards for all your governance processes and you know privacy processes and everything as well then we talk about data privacy specifically to support ccpa and gdpr for example customer data is very sensitive you have hepa rules in terms of health care data all of this is very very key so the data stewards essentially you work very closely with them to identify what's sensitive versus what's not sensitive and who can get access to what and then you work on data literacy your platform is set up and you basically want to improve and increase the adoption of it across the organization itself so data literacy programs where you provide continuous training that where it ensures that adoption and it ensures the continued usage of the platform so that it continues to stay effective and you have to continuously evaluate and run statistics on the platform and understand if there are certain areas of your platform that are not being used so you have added functionality but then no one is actually using it and try to understand that how did this become a priority and not get used that is an exercise as a leader that you have to continually keep doing and if you do these and this is like a small list of components essentially that lead to data democratization then you know you are i wouldn't say guaranteed but you are bound to move towards a successful implementation i love the clarity by which you approach the discussion here and how you position each element so concisely now of course out of all of these levers there's so many things that we can unpack from governance infrastructure upskilling out of these different levels what do you think is the most challenging an important leverage skill and why is that why is that the case so i think the most important one from my standpoint and what has been the most challenging is the data culture and that comes to user adoption and literacy itself it's human nature essentially you know you're very you're very comfortable with what you work with and you're comfortable with what you have access to it provides you so much security that you are the one who kind of knows this stuff and all of a sudden you come up with something new which is going to basically automate and then you're like oh my god now 70 of my time that i was spending on doing my stuff has now been reduced to 5 what do i do and that's i think the biggest challenge that organizations face and it's more so with legacy organizations organizations where you know which have been in place much before you know technology came into play so affecting data culture is from my perspective the most challenging and the most important level to scale that adoption is very very key if you truly want to enable your enterprise to be data driven so we have to make sure that the organization conforms to the guiding principles of the platform regarding governance privacy the consistency in key metrics that are being reported and we probably we can talk about enterprise key enterprise metrics being consistent across the organization at a later point but you know all of those are very very key in them having to accept it just so that you can truly create a culture of where the decisions that are made by data again are not skewed and or biased and coming from a place where the data cannot actually be trusted so i i think changing the culture from my perspective has to be the most challenging definitely and it's all about change management and giving the ability of people to kind of dig for data for themselves and creating that mindset shift as a data leader how has a lack of data culture affected the adoption of some of the solutions that you've developed even outside of creating a platform where people can do data for themselves but as a solution for the data science solutions that you adopt and what are the ways that you've been able to battle through such adoption issues so the biggest barrier that comes from a lack of data culture is essentially the proliferation of data silos within the organization you have data silos you have reporting silos you have organizations where you have 15 20 30 reporting tools within the organization you have don't even ask how many data architectures sitting over there data is not in a singular place reporting is all over the place and numbers are never matching and then organizations struggle to say okay which is the number that i should look at i'm setting a goal to basically say my sales has to increase by 10 next year but which sales should i look at when you look at it if there is a lack of that data culture or the lack of where you're not looking at a singular governed data set then all of a sudden these kind of problems become more rampant and they exist within organizations and it's not uncommon to have so you have increased total cost of ownership because now you have so many of these self managed data and data sets and then no ability to control so if you have sensitive data that moves across the organization you have no ability to manage or control that and if with things like ccpa that comes in it becomes so important that very sensitive data essentially has to be managed so closely so that if a customer comes and says i need to remove access to this data then you'll have to remove that data out of the system then it's not easy to do if it's all over the place and you have no way of identifying where it exists so i i think that lack of data culture does cause a lot of problems some of them are tangible some of them are not and some of them are visible and some of them are not but it does it causes a lot of issues how do we make data essentially a priority in every conversation with every initiative that takes place we start talking about what are the success metrics for that initiative and how are we going to measure it because you cannot manage what you can't measure and you have to be able to measure that so how do we bring data into that conversation now it may translate to saying that we truly don't have data needs but it is important for the data person to data leader to essentially have a seat at the table so that what that will allow is that okay now you have a whole list of stakeholders who are sitting with you and telling okay this is the initiative that is coming up and then it allows for you to have the conversation and say okay this is what it is going to touch this is what it's going to impact then for them to have a top down conversation where they cascade that information down to their to their leaders and then it goes down all the way to the bottom of the of the enterprise itself now the other thing you have to look at is also bottom-up so the top-down approach sometimes it's not very effective just for the reason that it's represented in a very different way so to ensure that that message is reached you have to ensure that there is also communication from up you show value essentially from what is coming out of your platform what is going to get impacted and see that how you can essentially provide that value for their business unit and our team also and so that needs to be something that you have to be continuously communicating to them so that's one of the ways in which you can ensure that that lack of data culture does not exist within the organization it of course happens in a piecemeal way in some organizations it's easy because they already know what they want in terms of data centralization or it exists it's just that they want to truly democratize data so depending on where you are within the organization there are there's that communication that you have to do either from top down or bottom up and that will help address the issues that basically come with that lack of data culture that the communication is very key that's really great and harping on the semantic layer that we discussed about and how it also empowers the data culture i've seen you speak about this quite a bit and we've mentioned this in our conversation so far is the importance of standardizing business metrics to be able to galvanize a data culture do you mind further expanding into that and how it helps accelerate the data culture within organizations absolutely so let me give an example in one of the organizations that i was working with there was a key metric that was in use within the organization and we had 13 different definitions of that metric 13 not one or two we had 13 different definitions of that organization so you can imagine just what happened that when that metric actually got reported across and that was a metric that was being used to compute to basically as part of our strategy to see that whether we were making progress in our initiative or not so it was very very important that that metric definition become consistent so that we know exactly what number we were starting from and what we were actually driving towards i talked about this so much because this again is something that is rampant across organizations finance for example has their own definition marketing has their own definition sales has their own definition and then so do other business units having their own definitions of each metric so bringing all those stakeholders together and trying to basically understand as to okay we have all of these definitions but as an organization we all need to kind of attach to one or one definition that we can come so that when we are actually communicating it is we are all speaking as one rather than speaking at six or seven different business units and not speaking six or seven different languages so inconsistency in metrics essentially means where the metric is defined differently across different business units and is being used in different forms but it is called the same thing and that leads to a lot of chaos and it's very very important that organizations essentially come to us an alignment in terms of how they truly want to define it that's really great and definitely a nightmare scenario when you have 13 different definitions of the same metric so if you're an organization or data leader looking at such a situation where there is a lot of messy definitions of metrics how do you approach the journey of reaching consistency what does it entail and what does that journey look like so what we do here is so first you start identifying what are the key metrics across your organization and we prioritize them think of it almost like a data journey for you i wouldn't say it's a multi-year journey but it is a journey to essentially get all of this aligned so you need to first understand what are the key metrics that are in use across the organization and specifically focus on metrics that are used across business units so if a single business unit has a metric that they use that's not of too much concern just for the reason that it's used just within that business unit but if there are metrics that are used across business units that's where the concern starts and so you identify basically the list of those metrics you sit in down with the stakeholders and begin a conversation now this is like the toughest part i don't think the build out of the metric or anything is difficult but it is this conversation where you have to get alignment in terms of metrics and the challenge with these conversations typically have been is who is going to take ownership of this metric going forward so you land on a definition but then who becomes that owner who becomes responsible for that metric so this conversation is very very important and very very key to basically have and you bring all of them together and you sit in and you tell them okay we have all of these so let's come now up to an alignment now in this case the data leader essentially is the facilitator or the coordinator of these conversations but you essentially are waiting for the business to make that decision in terms of what is the definition that should be used forward so you document and you take it across to the stakeholders and say that this is what we have now can you tell us that which is the one that should be used once they decide on that singular metric we say okay now who becomes the owner of that metric going forward so defining that data steward or the owner of that metric that person will be responsible now for all communication for that metric going forward so business changes the way we look at the way we pivot our business or the way we look at our customers our product changes for example and the metric definition is likely to change so this person then becomes responsible to say you know what just because of these changes that are happening going forward this is what the metric definition is going to look like and that communication needs to be done they need to get alignment from all the other stakeholders and say okay this is what is this is what it is going to be forward going forward this person is also going to be responsible for that definition in a business glossary so you have a business glossary within the organization so in some cases it's a fancy business glossary like it comes as part of your data governance tool but it can be something as simple as a shared excel document or a shared sharepoint site or a shared confluence document or something where you know that is all defined the definitions are put over there for the organization to basically see so in short basically this person owns the metric end to end and then we just go back repeat this exercise until we cover all the kpis and metrics that are in use within the organization so as i said again it's a journey i wouldn't say necessarily it's multi-year based on the size of the organization and based on the number of metrics that you have but till you identify and get to that point you just go through till you cover all the metrics once you have all the definitions in place then the data leader essentially goes back to his or her team and then goes and has that metric and in its definition put as a metric within the semantic layer so again this is why the cement clear is very very important so if i have something called financial then financial sales becomes a metric within my semantic layout so whoever accesses it going forward will get the exact same value and would you consider this like a massive low hanging fruit that can really accelerate data culture given that it's not a multi-year journey but something that just requires alignment people sitting in a room and this is one of the easiest way a data leader can make an impact in an organization oh my god yes yes i think this is so key and so important and people fail to see that you're absolutely right i like that i like the low hanging fruit yes this is something that can be so so easily achieved and can be done with just some very quick communication between teams and someone just taking ownership of something so simple something which they are already working with and they just take ownership of it so absolutely i think this is something that is very key to bringing at least landing and bringing that data culture much much closer to what you are looking for that's really great and i think this marks a really great segue to discuss kind of the role of the data leader and democratizing data i think the past few years we've definitely seen the role of the data leader whether that's a vp of analytics or data science a cdo a chief data analytics officer it evolved into much more of a culture sword and a change management steward rather than someone who just sets the data agenda of the data team so how do you have how have you seen this evolution over the past few years and what do you think of the data leaders role in democratizing data today so i think that role has evolved in leaps and bounds so if you looked at traditional data leadership essentially the data leader was an order taker so his or her responsibility was just pipe the data into the system and make it available to the end user there was nothing beyond that you attached a reporting tool on top of it and that's about it they were an afterthought typically when initiatives launched or projects launched but over the past few years when people started saying oh my god data is the new oil data is your latest asset and then everything is all data data data and people are like going all completely crazy this role has now completely like 365 and now what has happened is that the data leader has a seat at the table and their responsibility now is just not building those data pipes but now they are actively responsible for you know changing the data culture within the organization making sure that organization is data literate that data is being used very effectively across the organization in words it's basically they're like they become an evangelizer of data and they have to become that change agent which they originally like traditionally were not so now in with basically the just having like a role of where all you had to do was like pure data engineering now you have become like a data strategist your data transformist you have to think about how you're going to monetize data you have to think about you have to pretty much think about everything from a data perspective you have to see that and ensure that the value that the business thinks that they are going to get from the data you have to prove and provide that value to the organization so the role has completely shifted where the onus of all of this is now falling directly on the data leader so whoever is the data leader and who's sitting in the space has a great responsibility and i'm not saying that they were not responsible before it's just that a heavy responsibility has now been placed on them where they have to be so visionary so forward-looking in addition to just being someone who can execute and what do you find our key guiding principles to succeed as a data leader in such a stressful as well as high pressure environment where you have to really decide on and drive the data strategy of the entirety of the organization if you think about guiding principles i think one is communication is a big big big key right so when the leader gets into this role he or she may or may not have access to what they need to do their task effectively they have to go out reach out and ensure that they fully understand what is it that is the goal if it's not something that they understand at this point and what you have to do also is that you have to have as i said a strategy in place that strategy would kind of help you dictate as to what your next steps are going to be what you are actually going to do so that strategy is very important not only like a data strategy but you also have a forward-looking technology strategy for your team and for your organization itself so communication getting that executive support having a strategy in place essentially a lot of what we actually talked about in terms of the components of data democratization become also your principles for what you need to do to basically make your life much much easier and communication holds a big place because you not only have to communicate to your stakeholders continuously providing that value that they are looking for but also to the folks who are your end users and continually telling them in training them and telling them about the value of the data as well so that they adopt your platform so i would think in terms of guiding principles i would probably go back again to my components of data democratization and say that okay these are this is almost like a checklist for me and let me ensure that i'm going and performing each of these steps this should help reduce that stress that data leaders now face and then of course you keep yourself very engaged with the industry attend there are a lot of data and analytics conferences where thought leaders come they share their best practices they share their thoughts and challenges in terms of what's going on where they are challenged and hearing and listening to other people going through the same journey essentially helps you you know you have buddies who are who essentially are going through the same journey as you and you learn a lot essentially coming out of those i can imagine and you mentioned here communication being super key of course the data leader cannot do it all by themselves how do you choose your collaborate collaborators when embarking on these large transformational projects and what does successful collaboration look like in a con context of these data transformation initiatives so i see collaboration happening in multiple forms so again there is this whole piece of where you have a data strategy and you have priorities for your data itself so in that case it becomes a little easier because your collaborators are kind of already defined and you already know that they have strategic initiatives which tie to organizational goals and they become your partners very easily but if you are also building out and you are also looking to see that how can you how can you bring more business into your platform you basically reach out to a business unit so again that's where communication becomes very important you almost have to be i wouldn't say like a know-it-all but you have to be someone who's aware of things that are going on within the organization itself you reach out to business units and say you know what i've heard actually that you'll have this big initiative that you're embarking on and data may be a concern for you is there any way essentially that we can step in and help you out in some cases you are able to get this project funded but in some cases it's almost like you have to prove it as a proof of concept and once you prove it then you have a collaborator for life and that person also becomes your data champion and allows you to promote it so that's one kind of collaboration that you can do the second kind of collaboration that you do is going and talking to the top books you can go down to the bottom folks and you talk to them and sit down with them understand what their challenges are and you do a similar exercise in terms of how do we how can i help you there is opportunity here i see and how can i make this easy for you go to them reach out again it may be funded may not be funded again you do proof of value and again you've been a data champion for life or you've been a collaborator for life i i have done this successfully across my organization and i have seen that one is not only that these data champions become my it's like the train the trainer kind of mode they become my champions for across the organization so there have been meetings where they sit in and i have not sat in and they speak to the capabilities of my platform much better than i ever could and so that becomes an automatic showcase for my platform without me having to say anything so as i said you find collaborators in many forms you just have to be aware of what's going on from a data standpoint around your organization and you go reach out and you just have to go with the attitude that's possible that you're just going to probably get pushed out or you may not get funding but if there is value you know that has to be unlocked because sometimes people don't see the value that has to be unlocked but if you see that value to be unlocked then you almost you force take that thing and you say okay you know what let me prove it to you there's no cost to you i do it for you and you tell me whether this is not going to change your life and almost like 90 to 95 of the time it has changed i think that sees a lot of success and for me if you ask me that's the best way in which data championing or collaborations with others have democratized my data better within the organization that's really great and i love this virtuous cycle model that you're proposing where you start off with the low-hanging fruit it creates more evangelism of the data generates a lot more opportunities to do a lot more low-hanging fruits and it's like a cross-pollination of data within the organization do you mind sharing to a certain extent successful low-hanging fruits projects that you've worked on that were able to create data champions and how do you prioritize which projects to go after first in your data journey to be able to create this momentum and keep it going yeah i have a couple of use cases that i could actually share so in this one organization of ours we had this whole team of marketing analysts essentially who were there and one of those analysts essentially was spending 70 of his week on doing this repetitive task okay so he was given an exercise and spent 70 percent of his week doing all of this manually then again he repeats that same thing 70 percent of the week because the parameters have changed and or shifted and it's a challenge so you have this really intelligent individual who is here and that person is doing a manual task spending 70 on his time on probably something that he was not actually hired for originally but there is so much more value he can actually provide to the organization so we reached out first to his manager and then eventually to him and we said okay we can help support you and we can actually automate this whole function out for you and he looked a little worried and he's like there's no way this is so complex and everything we said why don't you just give us a chance and we can show basically what you can actually do so it took us a good two and a half to three weeks to basically get everything that he was doing manually automated and then not only did we like automate what he had to do but we also gave him the capability that tomorrow if he has to change or shift the parameters or variables within that that set he can do that with just the click of a button he saved not only that 70 percent of the time but just because he had all this additional time he got to work on more fun stuff and he was so happy at the end of all of this that he became like our champion for life he became our key mouthpiece in all meetings he would just keep saying oh if there's anything that you'll want to get done this is the team who's going to do it for you it my team's work was done because we were like okay we don't have to go and market ourselves anymore we don't have to prove value anymore we don't have to showcase anymore we have someone who is going to do this automatically for us it's so simple right as you very nicely said low hanging fruit these are like low hanging fruit that you just have to go figure out again you know they exist and go take your opportunity and go after it in another organization i was working there was a very critical problem of where they were doing this computation of on-hand inventory manually and it was at a very very very granular level so there was so much so much data to crunch that and again since they were doing it manually they were able to manage it only twice a year it's a very critical thing that needs to be done actually at a weekly level but one given just the size and nature of the data and the fact that they had to do it manually just allowed them to do it twice a year they could not do beyond that even that twice a year was like a nightmare for them so again we intervened we got into play and they were a very very very skeptical team very skeptical and again we said free of charge we'll just go and prove it to you and then if it works all you do is basically have this thing run on our system going forward and we went we built it in into our new platform so not only then did we save on resource efficiencies now basically the process even ran weekly as it was originally this basically desired so that was what the actual intent of that work was and again freed up a lot of time and then they could actually focus on other things that were sitting on their plate which they were not able to focus on because of this whole nightmarish thing they were doing so that cheta champion again became so happy that he worked with us to eliminate that data silo and the reporting tools his team was using so he said you know what we don't need this your platform is the one that we want to start building everything on so when it came to migrating everything from his tools and his platform on he became that champion for his team he went and spoke to his leadership and said that this is what we need to do and we need to move out of this so again easy low hanging fruit something we can all get to and something easy to do as a as a leader as a data leader as i said you have to be like a communicator and an evangelist and as long as you are able to get to the root of the problem and then figure a way to help fix it for them you are going to be able to successfully ensure that your organization is using data in the right way and they adopt your platform which essentially was just built for that purpose that's really great and i love these stories especially when you showcase the enthusiasm that it created now as a data leader you want to balance out between the low-hanging fruit and the strategic long-term projects and even the more technically demanding data science projects that require machine learning highly complex algorithms and tools the only subject matter experts like data scientists data engineers machine learning engineers know how to master how do you balance out that roadmap between short-term and long-term wins and what are some of the exciting kind of long-term projects that you've worked on as well awesome so this is how i manage and this is specifically for me how i manage so the way i resource and the way i think about it is 70 i work on strategic initiatives 10 is essentially ad hoc where my low hanging fruit falls and any other ad hoc needs and or requests that just pop in and then 20 is technical debt okay so we do accumulate technical debt and stuff starts getting older and then how do we keep it always new and fresh so that's how i do the breakdown now in some cases what happens is that just because of timelines one becomes ninety percent and then the others just become a little lower but overall as an at an average i maintain it like this 70 10 20 um percentages so just a little it just allows me to say okay i have managed and we should be able to get all of these things done and it for me it has worked basically pretty effectively now in terms of use cases that so there are a lot of use cases that i exciting ones that i've actually worked on but one that i was very extremely proud of not to say i was very excited about it but i was extremely very proud of was when in one of the organizations i worked with where we migrated to the aws cloud redshift had basically just come out i think it was about eight months old so we were almost going to be close to they probably were all having uh beta customers and not production customers and we were going to be one of the first ones going into like going into the aws redshift and the interesting thing was that my team had basically traditional data engineers including myself so we hadn't had access to cloud technologies and he hadn't done that and i created a training plan with one and a half months all of my team became like cloud i wouldn't say experts but we became we knew everything there was to know about the aws cloud and we probably could talk about that in a separate session in terms of how we did that but i was so surprised at how my team just they just all picked it up and in six weeks we were all ready we built out the whole architecture our whole architecture okay and this was not only the batch architecture we built a full streaming architecture in the cloud and we launched on time of course there was a lot of money savings because we were migrating out of a very expensive platform as well and then of course aligning with the broader technology strategy for the organization and then in addition what we were able to do is that we were able to build a fraud detection model on this platform and which would actually run in near real time and then spit out essentially back to the application it would spit out a score to say that whether this transactional was going to be fraudulent or not and that saved so much of the resources on the risk management team because otherwise risk was going through these things actually manually so our ability to basically be able to score the transactions and we had about a 92 percent accuracy i made it all worthwhile so a lot of exciting use cases to be worked on but this one i'm very particularly proud about that is so nice and as we end on a such an inspirational note talking about kind of the the value of creating a training plan do you have any final call to action me now before we wrap up i'll probably repeat what i said when you asked me to define data democratization so again data democratization is more than just bringing data all together into a central location so it's not only about solving data silos or centralizing data right it has to be it has to be truly an interface that the user can become self so it has to be an interface where the organization truly can say yes we are moving in a data driven fashion it has to be an interface in which you provide comfort you provide the tools you provide the access and then you provide all the processes the back-end processes of governance and privacy and all of that and help basically the organization succeed and becoming a data-driven environment so i think for me in terms of call to action i tell all data leaders is that focus on what that end view for the customer should look like rather than just focusing on piping data in and just bringing all that data into a central location thank you so much minelle for coming on the podcast i really appreciate it thank you so much adele thank you for letting me speak you've been listening to data framed a podcast by data camp keep connected with us by subscribing to the show in your favorite podcast player please give us a rating leave a comment and share episodes you love that helps us keep delivering insights into all things data thanks for listening until next timeyou're listening to data framed a podcast by data camp in this show you'll hear all the latest trends and insights in data science whether you're just getting started in your data career or you're a data leader looking to scale data driven decisions in your organization join us for in-depth discussions with data and analytics leaders at the forefront of the data revolution let's dive right in hi everyone this is adele data science educator and evangelist at datacamp if you've been listening to dataframe for a while now you probably know that data democratization is one of my favorite topics to discuss on the show every now and then i get to take a bird's-eye view with guests and discuss the broad ways organizations can become data-driven and this is one of these episodes today's guest is minal iyer minal is a data analytics strategist and transformational leader with over 22 years of experience building data analytics platforms and driving enterprises to be insights driven she has a wealth of knowledge and specialties including data literacy programs data monetization enterprise data analytics strategies amongst others she's led data teams at various retail organizations such as macy's tailored brands and more and she's one of the few data leaders i've spoken to that can really articulate a simple coherent strategy for data democratization throughout the episode we talk about the components of data democratization data culture and people the importance of standardizing business metrics to achieve data democratization which we spoke about at length how to enlist data champions as analytics leader and much more if you've enjoyed this episode make sure to rate subscribe and comment but only if you liked it also don't forget this week we'll be hosting datacamp radar our digital summit on june 23rd during the summit a variety of experts including minal from different backgrounds will be discussing everything related to the future of careers and data whether you're recruiting for their roles or looking to build a career in data there's definitely something for you seats are limited and registration is free so secure your spot today on events.datacam.com radar the link is in the description now on to today's episode minell it's great to have you on the show likewise adele thank you so much for the opportunity to speak in this show so i'm excited to deep dive with you on your work leading data science and tailored brands the importance of data democratization and how data leaders can really create a vibrant data culture but before can you give us a bit of a background about yourself absolutely so thanks again for letting me speak about data democratization it is one of my favorite topics to speak about so as terms of the background i've been in the space for about 20 over 20 years and have experienced like the good bad and ugly of the space worked across multiple industries which gave me also the experience to solve for unique data problems i enjoy or enabling data enterprises to essentially become data driven and you know that has kind of become my motto to provide the right information to the right people at the right time i'm currently working at tailored brands i had data science and experimentation here in addition to data and analytics that's really great so you mentioned here like data democratization is definitely the media today's conversation i want to set the stage for today's chat by first i'm backing what it means to democratize data you know we've seen a lot of organizations investing tons of resources and becoming data driven so i'd love to first start off the conversation to understand how you define data democratization if you ask me i think i found the best definition as something from what bernard marr basically said and what he said is and i quote uh data democratization means that everybody has access to data and there are no gatekeepers that create a bottleneck at the gateway to the data and the goal is to have anybody use data at any time to make decisions with no barriers to access or understanding and so if i have to kind of rephrase this a little bit i would just say data democratization is not just about data access but actually comfortable data access so you have a documented ready-to-use platform with tools and a culture that actually provides data-driven decision making and data literacy so in addition to just telling everyone that oh we have centralized all of our data access you actually need to make that access very very comfortable and convenient so end users can use it to become more self-service i love that concept of comfortable data access because i think a lot of people stop at just the data access component of it which really hurts the ability for people to make data-driven decisions at scale so as you mentioned there are definitely multiple components to democratizing data effectively this can range from scaling data access and data infrastructure centralizing data having like strong governed data as well culture and upskilling and a lot of different things that need to come together to be able to create a data-driven organization so i'd love it if you can walk us through the different ways organizations can accelerate their democratization absolutely i hope you have a pen and paper ready adele because it's a list of items so i would first say that if the organization is just starting with data democratization the first thing they need to have is an enterprise data strategy with a prioritized list of priorities as to what needs to be done in data itself the reason why this is so key and so critical is the fact that this is a multi-year journey and without the executive or enterprise support it is going to be almost impossible to make this mission a success if we do not have the strategy in place so this is very very key and very important so before anyone even embarks you have to know what is it that you're setting out to do and have basically your long-term vision set out once you have your data strategy set out you essentially start building out or thinking about a more forward-looking architecture so why we say forward-looking architecture is that you have to at least look out three to five years in terms of what your platform is going to be able to provide so you have your near-term priorities you have your strategic objectives for the organization where do you see your organizations in terms of data over the next three to five years and that's what your architecture is going to turn out to be so that you are not continuously you know doing migrations or not continuously trying to revisit this architecture and you are always in the process of building out and never at the point where the organization is fully self-sufficient with their data needs the third is so again we talk about where you try to reduce data silos within the organization so that's one key part of data democratization is to have access to the data vision within the organization itself now there may be some limitations to certain data which you may would not want to bring into a central warehouse or a central data lake and that is okay but for most part 85 to 90 percent of the organization's data essentially would bring would pipe into the system and so you need a centralized data team who basically helps with the piping and who helps essentially with managing the access across this the fourth is essentially data quality now this is very very key and very important to everything that you do you basically build data lineages and data quality as part of your process as you're piping data in so ensuring data quality will also give credibility in terms of what is being reported out of your platform and also when you get to the point where this goes to executors or when you start going into doing more data science or building data models and machine learning you know that your data is not skewed and or biased you can always trust the quality of the data so you trust the output of whatever is coming out of the data itself the fifth one and my most favorite is the semantic layer so the semantic layer is very very key just for the reason that that's the layer or the view that the end user has access to and this is the view that they use to essentially consume the data that exists within your ecosystem so when we talk about comfortable data access the semantic layer is the key to that comfortable data access because it tells your business users that okay you can basically query me without having to write very complicated sql or doing very complicated joins you have predefined metrics you have predefined kpis defined within that semantic layer so there is no reason for the organization to have confusion in terms of what is it that they're looking at or what is it that they're pulling out sixth is so once you have this platform in place you meet regularly with your stakeholders and you tell them okay and at every point in the journey actually you have to meet with your stakeholders and continue to update them on the status of what's going on and at this point what you should do is you have to be ready for some maybe reprioritization because these are organizational goals that you're trying to meet as part of the build out of this platform itself so you may expect some shift in priorities here or some additional priorities which you have to account for but the meeting with the stakeholders also ensures their continued support for this journey that you are on then you focus on basically the tools for the end users to use so these would be a standardized set of tools that essentially meet again the needs of almost 85 to 95 of your end user base there are going to be some end users who are going to have very specific use cases and you have to think about how you want to address those but for the bulk of the organization essentially you have to think about what are the standard tools that we have to provide to them so that they are able to get the most value out of this platform and then you have data stewards essentially identified for all of the data that is coming into this platform for each source of data that you're bringing into this platform you have data keywords identified so subject matter experts who essentially are able to help define or dictate as to what the rules of the data going from the system essentially coming is now these people also become your data stewards for all your governance processes and you know privacy processes and everything as well then we talk about data privacy specifically to support ccpa and gdpr for example customer data is very sensitive you have hepa rules in terms of health care data all of this is very very key so the data stewards essentially you work very closely with them to identify what's sensitive versus what's not sensitive and who can get access to what and then you work on data literacy your platform is set up and you basically want to improve and increase the adoption of it across the organization itself so data literacy programs where you provide continuous training that where it ensures that adoption and it ensures the continued usage of the platform so that it continues to stay effective and you have to continuously evaluate and run statistics on the platform and understand if there are certain areas of your platform that are not being used so you have added functionality but then no one is actually using it and try to understand that how did this become a priority and not get used that is an exercise as a leader that you have to continually keep doing and if you do these and this is like a small list of components essentially that lead to data democratization then you know you are i wouldn't say guaranteed but you are bound to move towards a successful implementation i love the clarity by which you approach the discussion here and how you position each element so concisely now of course out of all of these levers there's so many things that we can unpack from governance infrastructure upskilling out of these different levels what do you think is the most challenging an important leverage skill and why is that why is that the case so i think the most important one from my standpoint and what has been the most challenging is the data culture and that comes to user adoption and literacy itself it's human nature essentially you know you're very you're very comfortable with what you work with and you're comfortable with what you have access to it provides you so much security that you are the one who kind of knows this stuff and all of a sudden you come up with something new which is going to basically automate and then you're like oh my god now 70 of my time that i was spending on doing my stuff has now been reduced to 5 what do i do and that's i think the biggest challenge that organizations face and it's more so with legacy organizations organizations where you know which have been in place much before you know technology came into play so affecting data culture is from my perspective the most challenging and the most important level to scale that adoption is very very key if you truly want to enable your enterprise to be data driven so we have to make sure that the organization conforms to the guiding principles of the platform regarding governance privacy the consistency in key metrics that are being reported and we probably we can talk about enterprise key enterprise metrics being consistent across the organization at a later point but you know all of those are very very key in them having to accept it just so that you can truly create a culture of where the decisions that are made by data again are not skewed and or biased and coming from a place where the data cannot actually be trusted so i i think changing the culture from my perspective has to be the most challenging definitely and it's all about change management and giving the ability of people to kind of dig for data for themselves and creating that mindset shift as a data leader how has a lack of data culture affected the adoption of some of the solutions that you've developed even outside of creating a platform where people can do data for themselves but as a solution for the data science solutions that you adopt and what are the ways that you've been able to battle through such adoption issues so the biggest barrier that comes from a lack of data culture is essentially the proliferation of data silos within the organization you have data silos you have reporting silos you have organizations where you have 15 20 30 reporting tools within the organization you have don't even ask how many data architectures sitting over there data is not in a singular place reporting is all over the place and numbers are never matching and then organizations struggle to say okay which is the number that i should look at i'm setting a goal to basically say my sales has to increase by 10 next year but which sales should i look at when you look at it if there is a lack of that data culture or the lack of where you're not looking at a singular governed data set then all of a sudden these kind of problems become more rampant and they exist within organizations and it's not uncommon to have so you have increased total cost of ownership because now you have so many of these self managed data and data sets and then no ability to control so if you have sensitive data that moves across the organization you have no ability to manage or control that and if with things like ccpa that comes in it becomes so important that very sensitive data essentially has to be managed so closely so that if a customer comes and says i need to remove access to this data then you'll have to remove that data out of the system then it's not easy to do if it's all over the place and you have no way of identifying where it exists so i i think that lack of data culture does cause a lot of problems some of them are tangible some of them are not and some of them are visible and some of them are not but it does it causes a lot of issues how do we make data essentially a priority in every conversation with every initiative that takes place we start talking about what are the success metrics for that initiative and how are we going to measure it because you cannot manage what you can't measure and you have to be able to measure that so how do we bring data into that conversation now it may translate to saying that we truly don't have data needs but it is important for the data person to data leader to essentially have a seat at the table so that what that will allow is that okay now you have a whole list of stakeholders who are sitting with you and telling okay this is the initiative that is coming up and then it allows for you to have the conversation and say okay this is what it is going to touch this is what it's going to impact then for them to have a top down conversation where they cascade that information down to their to their leaders and then it goes down all the way to the bottom of the of the enterprise itself now the other thing you have to look at is also bottom-up so the top-down approach sometimes it's not very effective just for the reason that it's represented in a very different way so to ensure that that message is reached you have to ensure that there is also communication from up you show value essentially from what is coming out of your platform what is going to get impacted and see that how you can essentially provide that value for their business unit and our team also and so that needs to be something that you have to be continuously communicating to them so that's one of the ways in which you can ensure that that lack of data culture does not exist within the organization it of course happens in a piecemeal way in some organizations it's easy because they already know what they want in terms of data centralization or it exists it's just that they want to truly democratize data so depending on where you are within the organization there are there's that communication that you have to do either from top down or bottom up and that will help address the issues that basically come with that lack of data culture that the communication is very key that's really great and harping on the semantic layer that we discussed about and how it also empowers the data culture i've seen you speak about this quite a bit and we've mentioned this in our conversation so far is the importance of standardizing business metrics to be able to galvanize a data culture do you mind further expanding into that and how it helps accelerate the data culture within organizations absolutely so let me give an example in one of the organizations that i was working with there was a key metric that was in use within the organization and we had 13 different definitions of that metric 13 not one or two we had 13 different definitions of that organization so you can imagine just what happened that when that metric actually got reported across and that was a metric that was being used to compute to basically as part of our strategy to see that whether we were making progress in our initiative or not so it was very very important that that metric definition become consistent so that we know exactly what number we were starting from and what we were actually driving towards i talked about this so much because this again is something that is rampant across organizations finance for example has their own definition marketing has their own definition sales has their own definition and then so do other business units having their own definitions of each metric so bringing all those stakeholders together and trying to basically understand as to okay we have all of these definitions but as an organization we all need to kind of attach to one or one definition that we can come so that when we are actually communicating it is we are all speaking as one rather than speaking at six or seven different business units and not speaking six or seven different languages so inconsistency in metrics essentially means where the metric is defined differently across different business units and is being used in different forms but it is called the same thing and that leads to a lot of chaos and it's very very important that organizations essentially come to us an alignment in terms of how they truly want to define it that's really great and definitely a nightmare scenario when you have 13 different definitions of the same metric so if you're an organization or data leader looking at such a situation where there is a lot of messy definitions of metrics how do you approach the journey of reaching consistency what does it entail and what does that journey look like so what we do here is so first you start identifying what are the key metrics across your organization and we prioritize them think of it almost like a data journey for you i wouldn't say it's a multi-year journey but it is a journey to essentially get all of this aligned so you need to first understand what are the key metrics that are in use across the organization and specifically focus on metrics that are used across business units so if a single business unit has a metric that they use that's not of too much concern just for the reason that it's used just within that business unit but if there are metrics that are used across business units that's where the concern starts and so you identify basically the list of those metrics you sit in down with the stakeholders and begin a conversation now this is like the toughest part i don't think the build out of the metric or anything is difficult but it is this conversation where you have to get alignment in terms of metrics and the challenge with these conversations typically have been is who is going to take ownership of this metric going forward so you land on a definition but then who becomes that owner who becomes responsible for that metric so this conversation is very very important and very very key to basically have and you bring all of them together and you sit in and you tell them okay we have all of these so let's come now up to an alignment now in this case the data leader essentially is the facilitator or the coordinator of these conversations but you essentially are waiting for the business to make that decision in terms of what is the definition that should be used forward so you document and you take it across to the stakeholders and say that this is what we have now can you tell us that which is the one that should be used once they decide on that singular metric we say okay now who becomes the owner of that metric going forward so defining that data steward or the owner of that metric that person will be responsible now for all communication for that metric going forward so business changes the way we look at the way we pivot our business or the way we look at our customers our product changes for example and the metric definition is likely to change so this person then becomes responsible to say you know what just because of these changes that are happening going forward this is what the metric definition is going to look like and that communication needs to be done they need to get alignment from all the other stakeholders and say okay this is what is this is what it is going to be forward going forward this person is also going to be responsible for that definition in a business glossary so you have a business glossary within the organization so in some cases it's a fancy business glossary like it comes as part of your data governance tool but it can be something as simple as a shared excel document or a shared sharepoint site or a shared confluence document or something where you know that is all defined the definitions are put over there for the organization to basically see so in short basically this person owns the metric end to end and then we just go back repeat this exercise until we cover all the kpis and metrics that are in use within the organization so as i said again it's a journey i wouldn't say necessarily it's multi-year based on the size of the organization and based on the number of metrics that you have but till you identify and get to that point you just go through till you cover all the metrics once you have all the definitions in place then the data leader essentially goes back to his or her team and then goes and has that metric and in its definition put as a metric within the semantic layer so again this is why the cement clear is very very important so if i have something called financial then financial sales becomes a metric within my semantic layout so whoever accesses it going forward will get the exact same value and would you consider this like a massive low hanging fruit that can really accelerate data culture given that it's not a multi-year journey but something that just requires alignment people sitting in a room and this is one of the easiest way a data leader can make an impact in an organization oh my god yes yes i think this is so key and so important and people fail to see that you're absolutely right i like that i like the low hanging fruit yes this is something that can be so so easily achieved and can be done with just some very quick communication between teams and someone just taking ownership of something so simple something which they are already working with and they just take ownership of it so absolutely i think this is something that is very key to bringing at least landing and bringing that data culture much much closer to what you are looking for that's really great and i think this marks a really great segue to discuss kind of the role of the data leader and democratizing data i think the past few years we've definitely seen the role of the data leader whether that's a vp of analytics or data science a cdo a chief data analytics officer it evolved into much more of a culture sword and a change management steward rather than someone who just sets the data agenda of the data team so how do you have how have you seen this evolution over the past few years and what do you think of the data leaders role in democratizing data today so i think that role has evolved in leaps and bounds so if you looked at traditional data leadership essentially the data leader was an order taker so his or her responsibility was just pipe the data into the system and make it available to the end user there was nothing beyond that you attached a reporting tool on top of it and that's about it they were an afterthought typically when initiatives launched or projects launched but over the past few years when people started saying oh my god data is the new oil data is your latest asset and then everything is all data data data and people are like going all completely crazy this role has now completely like 365 and now what has happened is that the data leader has a seat at the table and their responsibility now is just not building those data pipes but now they are actively responsible for you know changing the data culture within the organization making sure that organization is data literate that data is being used very effectively across the organization in words it's basically they're like they become an evangelizer of data and they have to become that change agent which they originally like traditionally were not so now in with basically the just having like a role of where all you had to do was like pure data engineering now you have become like a data strategist your data transformist you have to think about how you're going to monetize data you have to think about you have to pretty much think about everything from a data perspective you have to see that and ensure that the value that the business thinks that they are going to get from the data you have to prove and provide that value to the organization so the role has completely shifted where the onus of all of this is now falling directly on the data leader so whoever is the data leader and who's sitting in the space has a great responsibility and i'm not saying that they were not responsible before it's just that a heavy responsibility has now been placed on them where they have to be so visionary so forward-looking in addition to just being someone who can execute and what do you find our key guiding principles to succeed as a data leader in such a stressful as well as high pressure environment where you have to really decide on and drive the data strategy of the entirety of the organization if you think about guiding principles i think one is communication is a big big big key right so when the leader gets into this role he or she may or may not have access to what they need to do their task effectively they have to go out reach out and ensure that they fully understand what is it that is the goal if it's not something that they understand at this point and what you have to do also is that you have to have as i said a strategy in place that strategy would kind of help you dictate as to what your next steps are going to be what you are actually going to do so that strategy is very important not only like a data strategy but you also have a forward-looking technology strategy for your team and for your organization itself so communication getting that executive support having a strategy in place essentially a lot of what we actually talked about in terms of the components of data democratization become also your principles for what you need to do to basically make your life much much easier and communication holds a big place because you not only have to communicate to your stakeholders continuously providing that value that they are looking for but also to the folks who are your end users and continually telling them in training them and telling them about the value of the data as well so that they adopt your platform so i would think in terms of guiding principles i would probably go back again to my components of data democratization and say that okay these are this is almost like a checklist for me and let me ensure that i'm going and performing each of these steps this should help reduce that stress that data leaders now face and then of course you keep yourself very engaged with the industry attend there are a lot of data and analytics conferences where thought leaders come they share their best practices they share their thoughts and challenges in terms of what's going on where they are challenged and hearing and listening to other people going through the same journey essentially helps you you know you have buddies who are who essentially are going through the same journey as you and you learn a lot essentially coming out of those i can imagine and you mentioned here communication being super key of course the data leader cannot do it all by themselves how do you choose your collaborate collaborators when embarking on these large transformational projects and what does successful collaboration look like in a con context of these data transformation initiatives so i see collaboration happening in multiple forms so again there is this whole piece of where you have a data strategy and you have priorities for your data itself so in that case it becomes a little easier because your collaborators are kind of already defined and you already know that they have strategic initiatives which tie to organizational goals and they become your partners very easily but if you are also building out and you are also looking to see that how can you how can you bring more business into your platform you basically reach out to a business unit so again that's where communication becomes very important you almost have to be i wouldn't say like a know-it-all but you have to be someone who's aware of things that are going on within the organization itself you reach out to business units and say you know what i've heard actually that you'll have this big initiative that you're embarking on and data may be a concern for you is there any way essentially that we can step in and help you out in some cases you are able to get this project funded but in some cases it's almost like you have to prove it as a proof of concept and once you prove it then you have a collaborator for life and that person also becomes your data champion and allows you to promote it so that's one kind of collaboration that you can do the second kind of collaboration that you do is going and talking to the top books you can go down to the bottom folks and you talk to them and sit down with them understand what their challenges are and you do a similar exercise in terms of how do we how can i help you there is opportunity here i see and how can i make this easy for you go to them reach out again it may be funded may not be funded again you do proof of value and again you've been a data champion for life or you've been a collaborator for life i i have done this successfully across my organization and i have seen that one is not only that these data champions become my it's like the train the trainer kind of mode they become my champions for across the organization so there have been meetings where they sit in and i have not sat in and they speak to the capabilities of my platform much better than i ever could and so that becomes an automatic showcase for my platform without me having to say anything so as i said you find collaborators in many forms you just have to be aware of what's going on from a data standpoint around your organization and you go reach out and you just have to go with the attitude that's possible that you're just going to probably get pushed out or you may not get funding but if there is value you know that has to be unlocked because sometimes people don't see the value that has to be unlocked but if you see that value to be unlocked then you almost you force take that thing and you say okay you know what let me prove it to you there's no cost to you i do it for you and you tell me whether this is not going to change your life and almost like 90 to 95 of the time it has changed i think that sees a lot of success and for me if you ask me that's the best way in which data championing or collaborations with others have democratized my data better within the organization that's really great and i love this virtuous cycle model that you're proposing where you start off with the low-hanging fruit it creates more evangelism of the data generates a lot more opportunities to do a lot more low-hanging fruits and it's like a cross-pollination of data within the organization do you mind sharing to a certain extent successful low-hanging fruits projects that you've worked on that were able to create data champions and how do you prioritize which projects to go after first in your data journey to be able to create this momentum and keep it going yeah i have a couple of use cases that i could actually share so in this one organization of ours we had this whole team of marketing analysts essentially who were there and one of those analysts essentially was spending 70 of his week on doing this repetitive task okay so he was given an exercise and spent 70 percent of his week doing all of this manually then again he repeats that same thing 70 percent of the week because the parameters have changed and or shifted and it's a challenge so you have this really intelligent individual who is here and that person is doing a manual task spending 70 on his time on probably something that he was not actually hired for originally but there is so much more value he can actually provide to the organization so we reached out first to his manager and then eventually to him and we said okay we can help support you and we can actually automate this whole function out for you and he looked a little worried and he's like there's no way this is so complex and everything we said why don't you just give us a chance and we can show basically what you can actually do so it took us a good two and a half to three weeks to basically get everything that he was doing manually automated and then not only did we like automate what he had to do but we also gave him the capability that tomorrow if he has to change or shift the parameters or variables within that that set he can do that with just the click of a button he saved not only that 70 percent of the time but just because he had all this additional time he got to work on more fun stuff and he was so happy at the end of all of this that he became like our champion for life he became our key mouthpiece in all meetings he would just keep saying oh if there's anything that you'll want to get done this is the team who's going to do it for you it my team's work was done because we were like okay we don't have to go and market ourselves anymore we don't have to prove value anymore we don't have to showcase anymore we have someone who is going to do this automatically for us it's so simple right as you very nicely said low hanging fruit these are like low hanging fruit that you just have to go figure out again you know they exist and go take your opportunity and go after it in another organization i was working there was a very critical problem of where they were doing this computation of on-hand inventory manually and it was at a very very very granular level so there was so much so much data to crunch that and again since they were doing it manually they were able to manage it only twice a year it's a very critical thing that needs to be done actually at a weekly level but one given just the size and nature of the data and the fact that they had to do it manually just allowed them to do it twice a year they could not do beyond that even that twice a year was like a nightmare for them so again we intervened we got into play and they were a very very very skeptical team very skeptical and again we said free of charge we'll just go and prove it to you and then if it works all you do is basically have this thing run on our system going forward and we went we built it in into our new platform so not only then did we save on resource efficiencies now basically the process even ran weekly as it was originally this basically desired so that was what the actual intent of that work was and again freed up a lot of time and then they could actually focus on other things that were sitting on their plate which they were not able to focus on because of this whole nightmarish thing they were doing so that cheta champion again became so happy that he worked with us to eliminate that data silo and the reporting tools his team was using so he said you know what we don't need this your platform is the one that we want to start building everything on so when it came to migrating everything from his tools and his platform on he became that champion for his team he went and spoke to his leadership and said that this is what we need to do and we need to move out of this so again easy low hanging fruit something we can all get to and something easy to do as a as a leader as a data leader as i said you have to be like a communicator and an evangelist and as long as you are able to get to the root of the problem and then figure a way to help fix it for them you are going to be able to successfully ensure that your organization is using data in the right way and they adopt your platform which essentially was just built for that purpose that's really great and i love these stories especially when you showcase the enthusiasm that it created now as a data leader you want to balance out between the low-hanging fruit and the strategic long-term projects and even the more technically demanding data science projects that require machine learning highly complex algorithms and tools the only subject matter experts like data scientists data engineers machine learning engineers know how to master how do you balance out that roadmap between short-term and long-term wins and what are some of the exciting kind of long-term projects that you've worked on as well awesome so this is how i manage and this is specifically for me how i manage so the way i resource and the way i think about it is 70 i work on strategic initiatives 10 is essentially ad hoc where my low hanging fruit falls and any other ad hoc needs and or requests that just pop in and then 20 is technical debt okay so we do accumulate technical debt and stuff starts getting older and then how do we keep it always new and fresh so that's how i do the breakdown now in some cases what happens is that just because of timelines one becomes ninety percent and then the others just become a little lower but overall as an at an average i maintain it like this 70 10 20 um percentages so just a little it just allows me to say okay i have managed and we should be able to get all of these things done and it for me it has worked basically pretty effectively now in terms of use cases that so there are a lot of use cases that i exciting ones that i've actually worked on but one that i was very extremely proud of not to say i was very excited about it but i was extremely very proud of was when in one of the organizations i worked with where we migrated to the aws cloud redshift had basically just come out i think it was about eight months old so we were almost going to be close to they probably were all having uh beta customers and not production customers and we were going to be one of the first ones going into like going into the aws redshift and the interesting thing was that my team had basically traditional data engineers including myself so we hadn't had access to cloud technologies and he hadn't done that and i created a training plan with one and a half months all of my team became like cloud i wouldn't say experts but we became we knew everything there was to know about the aws cloud and we probably could talk about that in a separate session in terms of how we did that but i was so surprised at how my team just they just all picked it up and in six weeks we were all ready we built out the whole architecture our whole architecture okay and this was not only the batch architecture we built a full streaming architecture in the cloud and we launched on time of course there was a lot of money savings because we were migrating out of a very expensive platform as well and then of course aligning with the broader technology strategy for the organization and then in addition what we were able to do is that we were able to build a fraud detection model on this platform and which would actually run in near real time and then spit out essentially back to the application it would spit out a score to say that whether this transactional was going to be fraudulent or not and that saved so much of the resources on the risk management team because otherwise risk was going through these things actually manually so our ability to basically be able to score the transactions and we had about a 92 percent accuracy i made it all worthwhile so a lot of exciting use cases to be worked on but this one i'm very particularly proud about that is so nice and as we end on a such an inspirational note talking about kind of the the value of creating a training plan do you have any final call to action me now before we wrap up i'll probably repeat what i said when you asked me to define data democratization so again data democratization is more than just bringing data all together into a central location so it's not only about solving data silos or centralizing data right it has to be it has to be truly an interface that the user can become self so it has to be an interface where the organization truly can say yes we are moving in a data driven fashion it has to be an interface in which you provide comfort you provide the tools you provide the access and then you provide all the processes the back-end processes of governance and privacy and all of that and help basically the organization succeed and becoming a data-driven environment so i think for me in terms of call to action i tell all data leaders is that focus on what that end view for the customer should look like rather than just focusing on piping data in and just bringing all that data into a central location thank you so much minelle for coming on the podcast i really appreciate it thank you so much adele thank you for letting me speak you've been listening to data framed a podcast by data camp keep connected with us by subscribing to the show in your favorite podcast player please give us a rating leave a comment and share episodes you love that helps us keep delivering insights into all things data thanks for listening until next time\n"