This is an archived version of the course. Please find the latest version of the course on the main webpage.

Chapter 3: Understanding your data

How many instances?

face Josiah Wang

We will explore the dataset (and scikit-learn) by trying to answer a series of questions. These are the kinds of questions that you should be asking yourself each time you are faced with a new dataset!

Question 1: How many instances/samples are there?

We assume that x and y are of the same length, so you can check either.

>>> print(len(x))
150
>>> print(len(y))
150
>>> assert len(x) == len(y)