3. Data Analytics - Data, Data, Everywhere - Week 3
Definition:
Database // a collection of data stored in a computer system
Stakeholders // people who invested time and resources into a project and are interested in the outcome
Formula // set of instructions that perform a specific calculation using the data in a spreadsheet
Function // a preset command that automatically performs a specific process or task using the data in a spreadsheet
Query Language // computer programming language that allows you to retrieve and manipulate data from a database. EX: SQL (Structured Query Language)
Data Analysis Tools:
- Spreadsheets; digital worksheet that stores, organize and sorts data. Google Sheets and Excel
- Databases
- Query Languages; SQL
- Visualization Software; Tableau and Looker
Data Lifecycle:
- Plan; what data is needed and who is responsible for collecting, storing, sharing data
- Capture; when data is collected from different sources (outside or inside resources)
- Manage; how to store, secure, and maintain properly
- Analyze; data is use to solve problems, make decisions, support business goals
- Archive; storing data where it is available but may not be used again
- Destroy; using erasure software or shred physical data
Data Analysis Process:
1. Ask; define the problem to be solved, the purpose, and understand stakeholder's expectations
Look at current state and identify how it's different from the ideal state.
Determine who stakeholders are. Manager? Sponsor? Sales partner?
2. Prepare; what type of data we need to answer questions, brand new or old data, collect or dont.
(quantitative data vs qualitative data)
(cross sectional or points in time or longitudinal over a long period of time)
3. Process; eliminate errors and inaccuracies that can affect the result.
Cleaning data and transforming it into a more useful format.
Removing outliers or combining datasets.
4. Analyze; using tools to transform and organize the data.
Use data to draw conclusions, predictions, and drive informed decision-making.
Do it objective and unbiased way.
5. Share; interpret results and share to others in effective ways
Visualizations is important.
R Language for data manipulation, calculation, and visualization.
6. Act; business takes all insights from data analysts to solve business problems
Maintaining Data Requires:
- Integrity
- Credibility
- Privacy
Additional Resources:
https://www.fws.gov/data/life-cycle
https://www.usgs.gov/data-management/data-lifecycle
https://sfmagazine.com/post-entry/july-2018-the-data-life-cycle/
https://online.hbs.edu/blog/post/data-life-cycle
Comments
Post a Comment