Components of Data Science Life Cycle
Data Science continues to evolve as the one of the most promising and demanding career of 21st century.
The insights drawn from the data is very much useful and profitable for the businesses when processed with intelligent algorithms to find pattern and insights from it.
The complete Data science follows a life cycle pattern which defines the steps of each stage of data and apply them to make it processed in more informative and easier way. The components of Data Science life cycle consist of five stages. Each stage have different tasks which perform on data during complete life-cycle span of Data science.
Fig:- Components of Data Science Life Cycle
The 5 components of Data Science Life Cycle are:-
1. Data Capturing
Capture of Data from different sources such that we derive some result from it after pre-processing the data.(including entry and extraction)
The task performed during complete span of Data Capturing is:
- Data Acquisition
- Data Entry
- Data Extraction
2. Data Maintain
Maintaining the data is often required when we handle with varieties of data and even the dataset provided for analysis is staged in different format. Maintaining the data and makes it available for process and analysis is done at this stage. Pre-processing is done just after which includes data cleaning, removal or replacement of Nan values with the average value of complete column (if necessary), outliers removal, etc.
The Data Maintain stage of Data science life-cycle includes:-
- Data Cleansing
- Data Staging
- Data Warehousing.
- Data Processing
3. Data Processing
Data may or may not be in proper format (i.e. structured data). So we have apply various techniques to processing the data such that it becomes prepared for analysis. Processing includes data modelling,data summarization (complete summary from the format structure data),data clustering and classification in various groups.
The data processing stage includes:
- Data Modelling
- Data Classification
- Data Summarization
4. Data Analyze
Analyze the data and finding the key insights is one of the challenging more decisive process. To analyze the data various various statistics test and algorithms performed by analyst to derived thepattern and insights from the data and the do storytelling about the analysis find from it.
The task performed during Data Analyze stage of data science life-cycle includes:
- Exploratory analyze
- Predictive Analysis
- Regression
- Qualitative Analysis
5. Data Communication
Data communication plays an key importance in data science life cycle. After analyze the data the main thing is to represent and visualize the insights such that everyone understand about what the data tells (insights, pattern) and its visual representation. After that decision making performed accordingly
The Data Communication stage includes:
- Data Visualization
- Data Reporting
- Decision Making
So this are the components Data Science Life Cycle. So at each stage of data science life cycle requires particular speciality and experiences to perform the process involved at each level and makes the data a story telling chapter. The components Data science Life cycle further combined with software development process and helps the data scientist and software engineers to develop the complete machine-learning based applications powered by Data science.
Comments
Post a Comment