Raster Data
Raster data are stored in the computer as a matrix. The cells are referenced by lines and elements . In the simplest form, each line is a computer record. Each record will contain the values for all elements in the line. Any cell not containing a feature would have the value of "0". In the simplest raster system, the value stored for each cell is the attribute component of the geographic data.
In more sophisticated raster systems, the cell value is a label that will link to records as an attribute file. In the above example, cells labeled as "1" could have many attributes, such as species composition, age of forest stand, and estimated volume of marketable timber.
![]() |
| -- Raster data are stored in computers as a matrix. Each cell is referenced by its line and element number. The example shown is for a small file with 10 lines and 10 elements. Cells A is located at line 3, elements 2. Cell B is located at line 6, element 8. |
![]() |
| -- Land cover as represented in a simple raster system. Cells with a "1" are forests, cells with a "2" are croplands, and cells with a "3" are rangelands. |
Because the raster system is strictly a two-dimensional matrix, various types of geographical data are stored as different layers or overlays in the GIS . One layer may contain land use/land cover, another layer may contain wetland data, and another layer may contain information on the transportation system.
![]() |
| -- Various types of geographic data may be stored as different layers or overlays in a GIS. |
The user of a raster system must determine the size of the cells to be used. This size is referred to as spatial resolution . The cell size can vary tremendously depending upon the size of the study area and the objectives for the GIS. Cell sizes as large as 20 ha for state or regional planning may be adequate. For a wildlife management area, a cell size of 0.05 ha or smaller might be required, depending upon the application of the GIS and the size of the wildlife management area. Storage requirements increase drastically as the cell size is reduced. Reducing the cell size by one-half will increase the data storage requirements by a factor of four. Conversely, as cell size increases, the precision of the representation of the land feature is reduced. Choosing the appropriate cell size for a particular GIS application is a compromise between cost of data storage and computer time and reliability of the representation of the land feature.
Vector Data
Vector data provide for high precision in representing the location of features. Aronoff (1989) described how vector data can be used to define the location of a point, a line, and an area. A point is represented by a simple pair of coordinates. The line is represented by an ordered list of pairs of coordinates. The area is represented as a polygon with ordered pairs of coordinates that close the polygon (the first and last pair being the same).
The coordinates can be any arbitrary units but usually are stored as UTM, state plane, or latitude and longitude coordinates. The first vector system used simple techniques to store the X and Y coordinates for polygons. In this simple system the coordinates for the common boundary between two areas were stored twice, once for the first area and again for the adjacent area. These duplicate storage techniques simplified computations and plotting but wasted storage space and, more importantly, provided no information as to adjacency or connectivity of geographic features (topology). Most vector systems now use topological models (Aronoff 1989) for representing the location of areas.
In topological models (Fig. 6), a polygon is defined by a series of arcs. Arcs begin and end at nodes, which occur wherever two or more arcs meet. Each arc is defined by a series of coordinates, starting with the coordinates for the beginning node and ending with the coordinates for the ending node. Topological relationships are stored in three tables. The polygon topology table describes the arcs that bound each polygon, the node topology table describes the arcs that end at each of the nodes, and the arc topology table describes which end points (nodes) occur on each arc and which polygons are to the left and right of each arc. These three topology tables provide the tools required to efficiently determine the positional relationships of one feature to other features. A coordinate table defining the coordinates for each arc also is used in topological models. In addition to these topological databases, the attributes for the features are stored in an attribute database.

-- In vector systems that use topological models, polygons are represented by a list of arcs (adopted from Aronoff 1989). The arcs required to defing each polygon are shown in the polygon topology table. The node topology table defines the arcs associated with each node. The arc topology table describes the starting and end nodes for each arc and defines the polygons to the left and right of each arc.






















