3. Data Analytics - Data, Data, Everywhere - Week 3

Definition:

Database // a collection of data stored in a computer system

Stakeholders // people who invested time and resources into a project and are interested in the outcome

Formula // set of instructions that perform a specific calculation using the data in a spreadsheet

Function // a preset command that automatically performs a specific process or task using the data in a spreadsheet

Query Language // computer programming language that allows you to retrieve and manipulate data from a database. EX: SQL (Structured Query Language)


Data Analysis Tools:

    - Spreadsheets; digital worksheet that stores, organize and sorts data. Google Sheets and Excel

    - Databases

    - Query Languages; SQL

    - Visualization Software; Tableau and Looker


Data Lifecycle:

    - Plan; what data is needed and who is responsible for collecting, storing, sharing data

    - Capture; when data is collected from different sources (outside or inside resources)

    - Manage; how to store, secure, and maintain properly

    - Analyze; data is use to solve problems, make decisions, support business goals

    - Archive; storing data where it is available but may not be used again

    - Destroy; using erasure software or shred physical data


Data Analysis Process:

    1. Ask; define the problem to be solved, the purpose, and understand stakeholder's expectations

            Look at current state and identify how it's different from the ideal state.

            Determine who stakeholders are. Manager? Sponsor? Sales partner?

    2. Prepare; what type of data we need to answer questions, brand new or old data, collect or dont.

            (quantitative data vs qualitative data)

            (cross sectional or points in time or longitudinal over a long period of time)

    3. Process; eliminate errors and inaccuracies that can affect the result.

            Cleaning data and transforming it into a more useful format.

            Removing outliers or combining datasets.

    4. Analyze; using tools to transform and organize the data.

            Use data to draw conclusions, predictions, and drive informed decision-making.

            Do it objective and unbiased way.

    5. Share; interpret results and share to others in effective ways

            Visualizations is important.

            R Language for data manipulation, calculation, and visualization.

    6. Act; business takes all insights from data analysts to solve business problems


Maintaining Data Requires:

    - Integrity

    - Credibility

    - Privacy


Additional Resources:

https://www.fws.gov/data/life-cycle

https://www.usgs.gov/data-management/data-lifecycle

https://sfmagazine.com/post-entry/july-2018-the-data-life-cycle/

https://online.hbs.edu/blog/post/data-life-cycle

Comments

Popular posts from this blog

2. FreeCodeCamp - Dynamic Programming - Learn to Solve Algorithmic Problems & Coding Challenges

20. Data Analytics - Analyze Data to Answer Questions - Week 1

3. Algorithms - Selection Sort