GIS 5007 - Data Classification

 Data classification is important when presenting data. It breaks raw data down into a more palatable and understandable format for the reader. It can also provide insight on and highlight patterns that may not have previously been visible. As such, this week we learned about various data classification methods. In the first instance we were introduced to the various types of data. They include qualitative data, which differentiates types of things and focuses more on characteristics, and quantitative data, which speaks to amount or magnitude. Once we learned of the various types of data we moved on to ways that data can be expressed, specifically via classification. The methods included:

  1. Equal Interval - classes are equal in range. 
  2. Natural Breaks - classes based on naturally occurring breaks in the data.
  3. Quantile - classes contain an equal amount of data in each class.
  4. Standard deviation - based on the mean and classes are formed based on deviation from the mean.
  5. Optimal Classification Method - classes include similar data values by minimizing an objective measure of classification error.



This week's assignment gave us the opportunity to put four of those methods to use. Below is a map of the distribution of senior citizens in Miami-Dade County, Florida. As the data displayed utilizes percentages, it has been normalized using the total population. The methods used are Natural Breaks (Jenks), Quantile, Equal Interval and Standard Deviation. 

We were also required to replicate the map utilizing senior citizen amounts normalized by land area. While normalization of data is important, it is also important to use an appropriate measure. Using an inappropriate measure can manipulate the appearance of the data and create a false narrative.  


Comments

Popular posts from this blog

GIS 5935 - Surfaces - TINs and DEMs

GIS 5935 - Data Quality Assessment

GIS 5100 - Coastal Flooding