vdat_csv_format #7

benjaminhlina · 2024-01-19T15:27:44Z

Hi Mike,

I'm finally out of a big writing phase and will be doing more data stuff in the coming weeks. I've made a small function that removes the top rows of the csv vdat_to_csv() creates and then assigns the correct column names given the selected event field. You can also make it return the top rows giving you the event field data frame. Let me know what you think/if this is of interest. Should note that right now I have it hardcoded for the number of rows to select (not ideal, I know)...not sure if that will be appropriate given different .vrl files depending on the receiver (I did this based of .vrl files from VR2W).

I didn't add this functionality/not sure how light you want the package but could make this function use {data.table} to increase speed and could have it wrap around fread() or read.csv() ect. so that the user doesn't have to tell either of those functions to not read in the header. Might be something to consider if detection logs are big. I played with the function using a small csv.

If wanting to keep light we could shunt this to a supplementary package but not sure if that is really necessary...it really depends on your vision with the package.

Cheers,
Ben

…into vdat-format

mhpob · 2024-01-20T14:16:55Z

Thanks for taking a stab at a new idea!

I think this would be a good fit for a function that vdat_to_csv could wrap around via an argument as well as standing alone. It could keep vdat_to_csv snappy for those who just want to have vdat convert files while allowing something that "just works" for those who also want to immediately work in R. I'm pretty hesitant about drifting into the analysis realm where we need to make decisions for the user, but I don't think this does that.

From my initial read, it seems like event_field could be inferred from event_type. Does it need to be a named argument, or could we just tack on "_DESC" to the user's event_field input to move toward something that "just works"?
To get away from the hard row or column indices could it use some sort of which(grepl( call to find the splitting index of choice? Or, we could switch pre-defined indices based on the receiver type and VEMCO DATA LOG version?
If possible, it would be useful to write some tests against the different types of receivers and files to make sure everything is behavin as expected. You can run source('tests/testthat/setup-testfiles.R') to download these to your temporary directory, locations listed in the testfiles object that is created. I'm happy to develop some of these if it's asking too much.

mhpob · 2024-01-20T14:24:12Z

Oh, and with regard to data.table: I'm trying to use as few dependencies as possible. At some point in the future I may refactor to drop those we're using, but cli makes the messages so pretty!

benjaminhlina and others added 12 commits January 5, 2024 17:25

Intial commit of locate_vdat

cc87c1d

Trials

1c91dd3

Merge branch 'mhpob:main' into serch-vdat

6d43635

Style code (GHA)

7a70e5e

Intial commit of vdat_csv_format

eed65c8

Documentation of vdat_csv_format

9cbe14f

Remove locate_vdat

5ad6ff0

Updated NAMESPACE

72e1175

Removed second export, I beleive this is why it was erroring

7003952

Style code (GHA)

8ddadbd

added small example

70e71c3

Merge branch 'vdat-format' of https://github.com/benjaminhlina/rvdat …

54e5440

…into vdat-format

mhpob self-assigned this Jan 20, 2024

mhpob added the enhancement New feature or request label Jan 20, 2024

Merge branch 'mhpob:main' into vdat-format

c703a8b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vdat_csv_format #7

vdat_csv_format #7

benjaminhlina commented Jan 19, 2024

mhpob commented Jan 20, 2024 •

edited

Loading

mhpob commented Jan 20, 2024

vdat_csv_format #7

Are you sure you want to change the base?

vdat_csv_format #7

Conversation

benjaminhlina commented Jan 19, 2024

mhpob commented Jan 20, 2024 • edited Loading

mhpob commented Jan 20, 2024

mhpob commented Jan 20, 2024 •

edited

Loading