Learn how to Work Smarter

Wednesday, December 9, 2009

Using strict keys

I finally modified the relationship between wells and water levels today; this was the troubling design:


...corrected to this:



Of course this made more trouble, adding and then correlating water levels with the new well screen identifiers (I just used "001" to replace all the Null values in ScreenIntervalID).  The next issue was a bit embarrassing -- my nice, clean data in tblGroundwaterLevel ... not so clean.  Found a number of violations, nulls, and weird missing data in this table.  Not surprising when accumulating data from a dozen different sources for one analytical tool, I have to constantly remind myself.

Enter the index.  This is a way of enforcing rules between records of data.  Check out UniqueKey; it's telling the tblGroundwaterLevel that records cannot repeat these four fields:
  • LocationName
  • ScreenIntervalID
  • LoggingDate
  • MeasuringMethod
Note that LoggingDate is a date/time field (as usual in Access/most DBs), so, if there are transducer readings at 8 am, 12, and 4 pm, each LoggingDate record is unique ("mm/dd/yyyy 08:00", "mm/dd/yyyy 12:00", and "mm/dd/yyyy 16:00").

Additionally, including "MeasuringMethod" guards against a key violation if there's a manual measurement at exactly the same moment as a transducer measurement is made, providing the data manager specifies different methods for the different data.


Keys guard against:
  • duplicate records
    • ex: importing from two different Excel files with overlap; or
    • importing the same transducer file more than once.
  • incomplete records
    • blank LoggingDates (this was a problem bringing in data from a northwest-based regulator)
    • blank levels
You'd think including StaticWaterDepth would be a good idea, but this introduces two problems:
  1. What to do when the DryWellFlag is true -- include a sounding for StaticWaterDepth?  Fake a value?
  2. We're guarding against duplicate WellID (LocationName/ScreenIntervalID) entries for the same date -- another issue I've seen importing data from a variety of sources is conflicting info for the same date/well -- so you want to catch imports like this:



(Wrinkle: while I could have -- probably should have -- made this the PrimaryKey (ensuring no nulls or duplicates in the future, and probably better compatibility with Access 2010), calling it a UniqueKey was easier because I could ignore nulls for a moment; save the index; go fix duplicates and Null ScreenIntervalIDs; and come back to enforce the index.)


Now to implement this in our template DB.

No comments:

Post a Comment

Home | My Schedule (Free/Busy) | Professional CV | Learn how to Work Smarter

A Little More Background

Friends & Followers