Creating tables

Using IBM® SPSS® Data Collection Survey Reporter, you can create multiple tables to analyze your data and generate results for different combinations of variables.

How table sequence affects population time

The table sequence directly affects the table population time. To understand this, we must first understand aggregation. When aggregating data, TOM scans data from beginning to end. The scanning is based on the HDATA structure. When variables on several tables are at the same level structure, and share the same bottom level, data aggregation is accomplished in a single pass.

What is the same level structure?

Using household.mdd as an example, person[..].trip[..].country and person[..].gender are at the same level structure because the trip level is the child of the person level. All of the variables under the two levels are at the same level structure. All variables at the top level are considered to be at the same level structure as the other levels.

Although person[..].trip[..].country and person[..].gender are at the same level structure, their level is actually different. The level for person[..].trip[..].country is person[..].trip while the level for person[..].gender is person.

What is bottom level?

A table contains several variables on the side and top. Each variable has its own level, and the lowest level is regarded as the bottom level. For example, the bottom level for the following table is person[..].trip.

Person[..].trip[..].gender>region*person[..].occupation
VariableLevel
Person[..].trip[..].gender person[..].trip
RegionHDATA
Person[..].occupationperson

The lowest level among HDATA, person, person[..].trip is person[..].trip.

How should tables be organized?

If tables that have the same bottom level are grouped together, you can aggregate the table values in a single pass. For example, the following sequence of tables requires four passes in order to aggregate data:

Table1region*housetype
Table2region*person[..].occupation
Table3region* tenure
Table4region*person[..].gender

Now examine the following sequence of tables that only requires two passes:

Table1region*housetype
Table2region* tenure
Table3region*person[..].occupation
Table4region*person[..].gender

Assuming that each pass takes one minute to complete, the first table sequence will take four minutes to populate while the second will take two minutes to populate.