graphical user interface
Technical, Data Analytics

Course 5: Analyse Data to Answer Questions, Module 1: Organise data for more effective analysis

Organising data makes the data easier to use in your analysis. In this part of the course, We’ll learn the importance of organising your data through sorting and filtering. We’ll explore these processes in both spreadsheets and SQL as you continue to prepare your data. Learning Objectives Course content Course 5 – Analyse Data to Answer Questions Module 1: Organise…

Continue Reading

Technical, Data Analytics

Course 4: Process Data from Dirty to Clean, Module 2: Clean it up

What is dirty data? Earlier, we discussed that dirty data is data that is incomplete, incorrect, or irrelevant to the problem you are trying to solve.  This section summarizes: Types of dirty data Duplicate data Description Possible causes Potential harm to businesses Any data record that shows up more than once Manual data entry, batch data imports, or data migration…

Continue Reading

Technical, Data Analytics

Course 4: Process Data from Dirty to Clean, Module 1: The importance of integrity

Scenario: calendar dates for a global company Calendar dates are represented in a lot of different short forms. Depending on where you live, a different format might be used.  Now, think about what would happen if you were working as a data analyst for a global company and didn’t check date formats. Well, your data integrity would probably be questionable.…

Continue Reading

Technical, Data Analytics

Course 3: Prepare Data For Exploration, Module 4: Organise and Secure Data

File organisation guidelines Every data analyst’s goal is to conduct efficient data analysis. One way to increase the efficiency of your analyses is to streamline processes that help save time and energy in the long run. Meaningful, logical, and consistent file names help data analysts organise their data and automate their analysis process. When you use consistent guidelines to describe…

Continue Reading

Technical, Data Analytics

Course 3: Prepare Data For Exploration, Module 3: Database Essentials

Maximise databases in data analytics Databases enable analysts to manipulate, store, and process data. This helps them search through data a lot more efficiently to get the best insights.  Relational databases A relational database is a database that contains a series of tables that can be connected to form relationships. Basically, they allow data analysts to organise and link data…

Continue Reading

Technical, Data Analytics

Course 3: Prepare Data For Exploration, Module 2: Data responsibility

Data Responsibility Rundown Key Learnings: Specific Topics Covered: Data anonymization What is data anonymization? We have been learning about the importance of privacy in data analytics. Now, it is time to talk about data anonymization and what types of data should be anonymized. Personally identifiable information, or PII, is information that can be used by itself or with other data to…

Continue Reading