Skip to main content

Components of Data Science Life Cycle



                                           Components of Data Science Life Cycle


Data Science continues to evolve as the one of the most promising and demanding career of 21st century.
The insights drawn from the data is very much useful and profitable for the businesses when processed with intelligent algorithms to find pattern and insights from it.

The complete Data science follows a life cycle pattern which defines the steps of each stage of data and apply them to make it processed in more informative and easier way. The components of Data Science life cycle consist of five stages. Each stage have different tasks which perform on data during complete life-cycle span of Data science.

                             
                          
                                           Fig:- Components of Data Science Life Cycle

 
  The 5 components of Data Science Life Cycle are:-

  1. Data Capturing

Capture of Data from different  sources such that we derive some result from it after pre-processing the data.(including entry and extraction)
     
      The task performed during complete span of Data Capturing is:

     -  Data Acquisition
     -  Data Entry
     -  Data Extraction

   2. Data Maintain

Maintaining the data is often required when we handle with varieties of data and even the dataset  provided for analysis is staged in different format. Maintaining the data and makes it available for process and analysis is done at this stage. Pre-processing is done just after which includes data cleaning, removal or replacement of Nan values with the average value of complete column (if necessary), outliers removal, etc.        

       The Data Maintain stage of Data science life-cycle includes:-
      
     - Data Cleansing
     - Data Staging
     - Data Warehousing.
     - Data Processing


  3. Data Processing

Data may or may not be in proper format (i.e. structured data). So we have apply various techniques to processing  the data such that it becomes prepared for analysis. Processing includes data modelling,data summarization (complete summary from the format structure data),data clustering and classification in various groups.

     The data processing stage includes:

     - Data Modelling
     - Data Classification
     - Data Summarization

  4. Data Analyze

Analyze the data and finding the key insights is one of the challenging more decisive process. To analyze the data various various statistics test and algorithms performed by analyst to derived thepattern and insights from the data and the do storytelling about the analysis find from it.

       The task performed during Data Analyze stage of  data science life-cycle includes:

      - Exploratory analyze
      - Predictive Analysis
      - Regression
      - Qualitative Analysis

  5. Data Communication

Data communication plays an key importance in data science life cycle. After analyze the data the main thing is to represent and visualize the insights such that everyone understand about what the data tells (insights, pattern) and its visual representation. After that decision making performed accordingly

      The Data Communication stage includes:

    - Data Visualization
    - Data Reporting
    - Decision Making

So this are the components Data Science Life Cycle. So at each stage of data science life cycle requires particular speciality and experiences to perform the process involved at each level and makes the data a story telling chapter. The components Data science Life cycle  further combined with software development process and helps the data scientist and software engineers to develop the complete machine-learning based applications powered by Data science.


      
      

Comments

Popular posts from this blog

Machine Learning and It's Types

                           Machine Learning and It's Types                                 Machine Learning is ability to automatically learn and improve from experience without being explicitly programmed. So rather than typing the code for all the times and do knowledge engineering, machine learning helps the machine  to learn from previous data and find insights and pattern from it.  Basically Data is train on given data set and and applied machine learning algorithm and it find insights. Simply put, Machine learning makes a computer act and think like a human. Types of machine learning           Supervised Learning In supervised learning you use labeled data,which is a data set that has been classified, to infer a learning algorithm. The data set is used as the basis for predicting the classification of other unlabeled data through the use of machine learning algorithms. Supervised and Unsupervised learning   Uns

When to Use HeatMap plot for Visualization of Data

HeatMap (Matrix) Plot Visualization for the Data: When to Use? Visual representation always helps in simplification either any real world entities or the data. Visualization  provides an pictorial representation so anyone can easily understand about the data and their insights(what they are representing and in which range the value is lying.                                                                                                                                                             Source: HeatMap Now when the data science becomes one of the popular domain in Computer science. It makes a big impact both in technology domain and in industries. Every industries now a days wants to find insights about their business data that are generated daily and improve and grow their business accordingly. So the data science jobs now become very trending. To make a complete analysis of data one's should many times go through visualization phase. Because everyone is not a good statist

Artificial Intelligence Transforms the World by Automating the Industries

              Artificial intelligence transforming the world slowly. The self-driving car, Amazon Alexa, IBM Watson, Google voice assistant all these are the few major examples of AI-powered system. The current impact of artificial intelligence makes it's a major field of study for computer science students regarding the future because there is a huge demand for machine learning and Artificial intelligence engineers and researchers in industry. By making everything automatic(self-learning technique) through computation it changes the world slowly. The current scenario of artificial intelligence is highly trending and many of the top multi-national companies acquire this technology to improve their business as well as more production. The one of core part of AI i.e. machine learning which is also also playing a majore role in this growth. . https://www.searchenterpriseai.techtarget.com After seeing the huge demands of machine learning and Artificial Intellig