Dataset

From Rice Wiki
Revision as of 06:22, 26 April 2024 by Rice (talk | contribs) (Created page with "In machine learning, a model operates on a '''dataset'''. = Attributes of a dataset = The '''completeness''' of a dataset is the extent to which it contains all relevant '''features''' necessary for a given task. A dataset needs to have a sufficient number of observations, measured by the '''size''' of the dataset. The '''validity''' of the dataset is how accurate, clean, and relevant the data in the dataset is. A dataset can be '''high dimensional''', meaning that i...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

In machine learning, a model operates on a dataset.

Attributes of a dataset

The completeness of a dataset is the extent to which it contains all relevant features necessary for a given task.

A dataset needs to have a sufficient number of observations, measured by the size of the dataset.

The validity of the dataset is how accurate, clean, and relevant the data in the dataset is.

A dataset can be high dimensional, meaning that it has very high number of features, which can make calculations difficult.