Skip to content

Latest commit

 

History

History
219 lines (149 loc) · 10.9 KB

DMIS.md

File metadata and controls

219 lines (149 loc) · 10.9 KB

Overview

The Data Matching and Imputation System has been replaced by CAMS. It was constructed and maintained by GARFO's APSD. It contains finalized data through April 30, 2023, with a last data update from August 9, 2023.

"Live" Views to DMIS:

  • NEFSC_GARFO.APSD_t_ssb_trip_current
  • NEFSC_GARFO.APSD_t_ssb_catch_current
  • NEFSC_GARFO.APSD_t_ssb_discard_current

obsolete tables:

  • APSD.t_ssb_trip@garfo_nefsc
  • APSD.t_ssb_catch@garfo_nefsc
  • APSD.t_ssb_discard@garfo_nefsc

Location: NEFSCDB1

Schema: NEFSC_GARFO

DMIS and AA are expected to be replaced by CAMS.

Current Collection Methods

GARFO provided metadata

Here are a few documents that describe DMIS.

  1. DMIS Data Dictionary. Circulated by Tammy Murphy on April 27, 2020.
  2. DMIS Data Dictionary part 2. Circulated by Tammy Murphy on May 1, 2020. I'm not sure how this differs from the previous.
  3. DMIS Documentation. This appears to be from J. Michael Lanning and last modified on September 26, 2019.
  4. DMIS and AA comparison. J. Michael Lanning's Presentation that describes AA and DMIS, given to NRCC on May 15,2018.
  5. DMIS Binning Rules for the "fishery_group" column.

Ben Galuardi has built a custom DMIS table that combines the surfclam data with the DMIS data. It runs from 2007 to present. The code used to generate it is here and here. This DMIS table can be queried by:

select * from APSD.DMIS_WIND_TEST@garfo_nefsc

This table was constructed specifically to support the offshoreWind. It only has a subset of columns, but it retains linking variables additional data can easily be added to it.

Changes to collection methods

Tips and Tricks

General Caveats

Linking to Veslog

Linking to VESLOG with DMIS DOCID has a few issues. (Chad Demarest, May 14, 2020)

  1. JML adds digits to DMIS DOCID to denote subtrips.
  2. Another is that some data handling protocols (varies by file type) truncate a digit off the end of the EVTR serial numbers.
  3. A third is that DMIS will Give positive (mostly correct) match’s where the DOCID fails for these and other reasons.

As of Feb 2022, I'm not sure if (1) is still true:

select docid_length,count(*) from(
select length(to_char(docid)) as docid_length from NEFSC_GARFO.APSD_t_ssb_trip_current where docid is not null) group by docid_length;

because the docids are all either 7 (paper) or 14 (electronic) digits

For (2): you can use the to_char() function in oracle sql to force oracle to extract this column as a string. You can leave it as a string or use your statistical software to convert the string back to a number.

On Home Consumption Fish

The code for this stuff is BHC_ (either _LIVE_POUNDS or LANDED_POUNDS). "BHC" stands for "Bait and Home Consumption". A few years ago they added LUMF (Legal UnMarketable Fish) to this category as well. LUMF, at lease for here, and at least as I understand it, is derived from observer trips only. But I'm not 100% sure on that. If you use DLR_LIVE or DLR_LANDED (or DLR_DOLLAR) you won't get the BHC fish. If you use LANDED or POUNDS or DOLLAR_SSB/GDP you'll get 'em, plus imputed values for the DOLLAR field. Starting on the next run of DMIS, fish that are authorized to be landed on EFP trips but are not sold thu a dealer will be added to the BHC fish. Chad's email April 11, 2018.

Permits

DMIS has a set of True/False variables corresponding to permit categories. It is missing some: for example, there is no column for the Herring category E permit.

Versions

There is a "_current" versioning of tables. As of April 2020, current contains a little bit more data. I now use _trip for everything, whereas a yearly ago I used _current. There were some discrepancies for earlier years (2013?) that _trip had correct and _trip_current did not. Otherwise I believe they are the same. (Chad)

I have been using current since there is more recent data included. Hopefully for older data the two match up, but I haven't looked into that in a while. (Greg)

*UPDATE FROM Sept. 30, 2021: APSD.t_ssb_trip@garfo_nefsc and APSD.t_ssb_catch@garfo_nefsc apparently are no longer in use. Recommend to use the _CURRENT tables. Below is the _CURRENT tables values minus the "NON-CURRENT" values for the full DMIS time period. If pulling exclusively before 2019, then values are similar.

Calenday_Year Total_Observations dlr_dollar dollar_ssb
2007 0 0 0
2008 0 0 0
2009 384 -478 17,367
2010 -32 50,712 269,606
2011 219 -3,525 14,669
2012 -23 2,420 58,671
2013 0 0 0
2014 0 0 0
2015 0 0 0
2016 0 0 0
2017 0 0 0
2018 0 0 0
2019 52,595 42,412,031 41,091,647
2020 618,861 923,960,142 956,768,538
2021 468,514 1,030,426,010 1,077,516,423

Prices

Here is the algorithm for doing prices in DMIS circulated on March 7, 2021.

======  what you sent ====== 
The imputation is in the following order: 
Day / (NESPP3 , MARKET GRADE) / Port                      :     Code:   D4P 
Day / (NESPP3 , MARKET GRADE) / County                 :     Code:   D4C 
Day / (NESPP3 , MARKET GRADE) / State                    :     Code:   D4S 
Day / (NESPP3 , MARKET GRADE) / Region                 :     Code:   D4P 
Day / (NESPP3 , MARKET GRADE) / North East           :     Code:   D4 
Week/ (NESPP3 , MARKET GRADE) / North East         :     Code:   W4 
Month / (NESPP3 , MARKET GRADE) / North East        :     Code:   M4 

Then 

Day / (NESPP3 ) / Por                      :     Code:   D3P 
Day / (NESPP3) / County                 :     Code:   D3C 
Day / (NESPP3) / State                    :     Code:   D3S 
Day / (NESPP3) / Region                 :     Code:   D3P 
Day / (NESPP3) / North East           :     Code:   D3 
Week/ (NESPP3) / North East         :     Code:   W3 
Month / (NESPP3) / North East        :     Code:   M3 
================== 

MODIFICATION: 

The imputation is in the following order: 

Day / (NESPP3 , MARKET GRADE) / Port                      :     Code:   D4P 
Day / (NESPP3 , MARKET GRADE) / County                 :     Code:   D4C 
Day / (NESPP3 , MARKET GRADE) / State                    :     Code:   D4S 
Day / (NESPP3 , MARKET GRADE) / Region                 :     Code:   D4P  (D4R??) 
Day / (NESPP3 , MARKET GRADE) / North East           :     Code:   D4 

Then FOR NON-MONKFISH: 

Day / (NESPP3 ) / Port                      :     Code:   D3P 
Day / (NESPP3) / County                 :     Code:   D3C 
Day / (NESPP3) / State                    :     Code:   D3S 
Day / (NESPP3) / Region                 :     Code:   D3P (D3R??) 
Day / (NESPP3) / North East           :     Code:   D3 

Week/ (NESPP3) / North East         :     Code:   W3 
Month / (NESPP3) / North East        :     Code:   M3 

THEN FOR MONKFISH ONLY: 

Week / (NESPP3 , MARKET GRADE) / Port
Week / (NESPP3 , MARKET GRADE) / County
Week / (NESPP3 , MARKET GRADE) / State
Week / (NESPP3 , MARKET GRADE) / Region
Week/ (NESPP3 , MARKET GRADE) / North East      


Month / (NESPP3 , MARKET GRADE) / Port
Month / (NESPP3 , MARKET GRADE) / County
Month / (NESPP3 , MARKET GRADE) / State
Month / (NESPP3 , MARKET GRADE) / Region
Month / (NESPP3 , MARKET GRADE) / North East
       
Week/ (NESPP3) / North East
Month / (NESPP3) / North East     

Price Variables

There are two different price variables within DMIS.

DLR_DOLLAR: The value of landed catch to the nearest dollar, paid to fishermen by the dealer, for a given species. All value from for this variable comes directly from dealer reports.

DOLLAR_SSB: In addition to the values represented in the DLR_DOLLAR variable, DOLLAR_SSB imputs values according to the above code. Imputation is performed in the cases of bait and home consumption (BHC) or if there is a VTR record, but no matching dealer report. *Greg has some correspondence from Dan Caless that he can forward to anyone interested.

Completeness of Fishery Revenue data

For many fisheries in the Greater Atlantic Region, DMIS (GARFO_NEFSC.APSD_t_ssb_catch_current) includes the vast majority of ex-vessel revenues as compared to CFDBS. However, in fisheries which have large state waters components, or unqiue reporting requirements, DMIS may fail to capture a significant portion of revenues. A comparison of fishery ex-vessel revenues in 2018 for the two data sources is provided below. Note lobster and SC/OQ.

Schema Black Sea Bass Bluefish Groundfish Herring Jonah Crab Lobster Mackerel Monkfish Red Crab
DMIS Query $9,488,729 $956,188 $47,767,489 $23,096,764 $18,313,781 $459,191,551 $4,276,065 $14,782,472 $3,594,397
Dealer Query $12,060,017 $2,073,282 $48,352,305 $23,150,685 $18,574,993 $631,134,282 $4,348,775 $14,937,020 $3,594,397
Schema Scup Sea Scallop Skate Spiny Dogfish Squid Summer Flounder Surf Clam / Ocean Quahog Tilefish Whiting
DMIS Query $8,537,625 $524,952,412 $7,042,994 $2,627,879 $61,797,190 $22,780,233 $3,023,109 $5,098,011 $9,967,509
Dealer Query $9,708,960 $532,086,840 $7,437,288 $2,832,603 $62,300,665 $25,395,970 $61,321,459 $5,152,426 $10,040,221

Sample Code

  1. Code to match observer data to DMIS
  2. This will get you groundfish trips every time:
select XYZ
from NEFSC_GARFO.APSD_t_ssb_trip_current T, NEFSC_GARFO.APSD_t_ssb_catch_current C
where C.trip_id=T.trip_id
and C.fishery_group in ('GROUND', 'OTHER2')
and T.groundfish_permit is not null
and T.sector_id <> 2
and C.secgearfish <> 'CAR'

Update Frequency and Completeness

Other Metadata sources

  • Preceded by:
  • Succeeded by: CAMS (tbd)

Related Tables

Support Tables