Basic Concepts |
SAS Data
Sets |
Data
Portion
The data portion of a SAS data set is a collection of data values arranged in a rectangular table. In the example below, the name Jones is a data value, the weight 158.3 is a data value, and so on. |
Data portion |
|
Observations (Rows)
Rows (called observations) in the data set correspond to records or data lines in a raw data file or external database. An observation is the information about each entity in a SAS data set. The values Jones, M, 48, and 128.6 make up a single observation. |
Observation |
|
The data set shown above has four observations, each containing information about an individual. A SAS data set can store any number of observations. |
Variables (Columns)
Columns (called variables) in the data set correspond to
fields in a raw data file or external database. A variable is the set of
data values that describes a given characteristic. The values Jones,
Laverne, Jaffe, and Wilson make up the variable
|
Variable |
|
The data set above contains four variables, or categories
of information, about each person: Name , Sex ,
Age , and Weight . A SAS data set can store thousands
of variables. Only the capacity of your storage device limits the number
of variables in your SAS data sets. |
Missing Values
The rectangular arrangement of rows and columns in a SAS data set implies that every variable must exist for each observation. If a data value is unknown for a particular observation, a missing value is recorded in the SAS data set. |
Missing
value |
|
Copyright © 2002 SAS Institute Inc., Cary, NC, USA. All rights reserved.