logo_ds

Module 2: Not So Scary Wrangling (Table Manipulation and Chaining)

In this module, you will learn how to import different types of files, perform more advanced table manipulations (modifying and creating new columns) as well as method chaining conventions (style, including multi-line).

0Module Learning Outcomes

1Reading in Different File Types

2Delimiter

3Reading in a URL

4Reading in a Text File

5Reading in an Excel File

6Arguments for Reading Data

7Name that Argument!

8Using Arguments when Reading in Files

9Column Renaming and Dropping

10Column Editing Questions

11Renaming a Column Index

12Droping Columns in a Dataframe

13Column Arithmetic and Creation

14 Column Arithmetic Questions

15Creating a New Column

16Data Filtering

17Filtering Question

18Single Condition Filtering

19Filtering using "and"

20Conditional Value Replacement

21Practice Replacing Values

22Chaining Notation

23Chaining True/False

24Practice Chaining

25Grouping and Aggregating

26Fruit Salad Grouping and Aggregating

27Practice Grouping

28Practice Aggregating

29Plotting with Altair

30Plotting a Groupby Object

31What Did We Just Learn?

About this course

Learn the fundamentals of programming in Python, including how to clean, filter, arrange, aggregate and transform data. You will learn the foundations of programming in Python while writing human-readable code that sets a foundation of best practices and coding style. You will gain the skills to clean, filter, manipulate (wrangle) and summarize data using Python libraries for more effective data analysis. An overview of data structures, iteration, flow control and program design relevant to data exploration and analysis will be addressed along with fundamental programming concepts such as loops, conditionals and data structures that create a solid foundation in data science programming.

About the program

The University of British Columbia (UBC) is a comprehensive research-intensive university, consistently ranked among the 40 best universities in the world. The Key Capabilities in Data Science program was launched in September 2020 and is developed and taught by many of the same instructors as the UBC Master of Data Science program.