graphical user interface
Technical, Data Analytics

Course 5: Analyse Data to Answer Questions,  Module 4: Perform Data Calculations

Calculations are a common task for data analysts. In this part of the course, you’ll explore formulas, functions, and pivot tables in spreadsheets and queries in SQL, all of which will help with your calculations. You’ll also learn about the benefits of using SQL to manage temporary tables. Learning Objectives Functions with multiple conditions Conditional functions and formulas perform calculations…

Continue Reading

Technical, Data Analytics

Course 4: Process Data from Dirty to Clean, Module 2: Clean it up

What is dirty data? Earlier, we discussed that dirty data is data that is incomplete, incorrect, or irrelevant to the problem you are trying to solve.  This section summarizes: Types of dirty data Duplicate data Description Possible causes Potential harm to businesses Any data record that shows up more than once Manual data entry, batch data imports, or data migration…

Continue Reading

Technical, Data Analytics

Course 4: Process Data from Dirty to Clean, Module 1: The importance of integrity

Scenario: calendar dates for a global company Calendar dates are represented in a lot of different short forms. Depending on where you live, a different format might be used.  Now, think about what would happen if you were working as a data analyst for a global company and didn’t check date formats. Well, your data integrity would probably be questionable.…

Continue Reading

Technical, Data Analytics

Course 3: Prepare Data For Exploration, Module 4: Organise and Secure Data

File organisation guidelines Every data analyst’s goal is to conduct efficient data analysis. One way to increase the efficiency of your analyses is to streamline processes that help save time and energy in the long run. Meaningful, logical, and consistent file names help data analysts organise their data and automate their analysis process. When you use consistent guidelines to describe…

Continue Reading

Data Analytics, Technical

Course 3: Prepare Data For Exploration, Module 3: Database Essentials

Maximise databases in data analytics Databases enable analysts to manipulate, store, and process data. This helps them search through data a lot more efficiently to get the best insights.  Relational databases A relational database is a database that contains a series of tables that can be connected to form relationships. Basically, they allow data analysts to organise and link data…

Continue Reading

Technical, Data Analytics

Course 3: Prepare Data For Exploration, Module 2: Data responsibility

Data Responsibility Rundown Key Learnings: Specific Topics Covered: Data anonymization What is data anonymization? We have been learning about the importance of privacy in data analytics. Now, it is time to talk about data anonymization and what types of data should be anonymized. Personally identifiable information, or PII, is information that can be used by itself or with other data to…

Continue Reading

Technical, Data Analytics

Course 3: Prepare Data For Exploration, Module 1: Data types and structures

Select the right data Following are some data-collection considerations to keep in mind for your analysis: How the data will be collected Decide if you will collect the data using your own resources or receive (and possibly purchase it) from another party. Data that you collect yourself is called first-party data. Data sources If you don’t collect the data using…

Continue Reading