Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update help text for top-level super command #5426

Merged
merged 2 commits into from
Nov 3, 2024
Merged

update help text for top-level super command #5426

merged 2 commits into from
Nov 3, 2024

Conversation

mccanne
Copy link
Collaborator

@mccanne mccanne commented Nov 3, 2024

No description provided.

@mccanne mccanne requested review from philrz and a team November 3, 2024 03:00
HiddenFlags: "cpuprofile,memprofile,pathregexp",
Long: `
XXX "super" is a command-line tool for processing data in diverse input formats,
The "super" command provides a way to process data in diverse input formats,
providing search, analytics, and extensive transormations using
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
providing search, analytics, and extensive transormations using
providing search, analytics, and extensive transformations using

A query typically applies Boolean logic or keyword search to filter
the input and then transforms or analyzes the filtered stream.
Output is written to one or more files or to standard output.

A Zed query is comprised of one or more operators interconnected
A query is comprised of one or more operators interconnected
into a pipeline using the Unix pipe character "|".
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
into a pipeline using the Unix pipe character "|".
into a pipeline using the Unix pipe character "|>".

Want to aspirationally make this change now on the assumption #5427 will go through?

BTW, I briefly paused on what to call |> since it's not a single character, but I see the Google paper still calls it " the pipe character '|>'" so it seems like we can just go with that (though I also think the Unix comparison is still totally worth keeping).

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

using the pipe symbol "|>"

A query typically applies Boolean logic or keyword search to filter
the input and then transforms or analyzes the filtered stream.
Output is written to one or more files or to standard output.

A Zed query is comprised of one or more operators interconnected
A query is comprised of one or more operators interconnected
into a pipeline using the Unix pipe character "|".
See https://github.com/brimdata/super/tree/main/docs/language
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
See https://github.com/brimdata/super/tree/main/docs/language
See https://zed.brimdata.io/docs/language

into a pipeline using the Unix pipe character "|".
See https://github.com/brimdata/super/tree/main/docs/language
for details.
for details. The "select" and "from" operators provide backward
compatible with SQL. In fact, you can use SQL exclusively and
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
compatible with SQL. In fact, you can use SQL exclusively and
compatibility with SQL. In fact, you can use SQL exclusively and


Supported input formats include CSV, JSON, NDJSON, Parquet,
Supported file formats include Arrow, CSV, JSON, Parquet,
Super JSON, Super Binary, Super Columnar, and Zeek TSV.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Super JSON, Super Binary, Super Columnar, and Zeek TSV.
Super JSON, Super Binary, Super Columnar, and Zeek TSV.

For most types of data, the input format is automatically detected.
If multiple files are specified, each file format is determined independently
so you can mix and match input types. If multiple files are concatenated
into a stream and presented as standard input, the files must all be of the
same type as the beginning of stream will determine the format.

If no input file is specified, the default of a single null value will be
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
If no input file is specified, the default of a single null value will be
If no input file is specified, the default of a single null input value will be

For most types of data, the input format is automatically detected.
If multiple files are specified, each file format is determined independently
so you can mix and match input types. If multiple files are concatenated
into a stream and presented as standard input, the files must all be of the
same type as the beginning of stream will determine the format.

If no input file is specified, the default of a single null value will be
fed to the query. This is analogous to SQL's default input of a single
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
fed to the query. This is analogous to SQL's default input of a single
fed to the query. This is analogous to SQL's default input of a single

convenient when a large, complex query spans multiple lines. In this case,
these source files are concatenated together along with the command-line query text
in the order appearing on the command line.
in the order appearing on the command line. Any error messages are properly
collated to the included file in which it occurred.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
collated to the included file in which it occurred.
collated to the included file in which they occurred.

"zq" must be run with at least one input. Input files can
be file system paths; "-" for standard input; or HTTP, HTTPS, or S3 URLs.
Input files may be file system paths;
"-" for standard input; or HTTP, HTTPS, or S3 URLs.
For most types of data, the input format is automatically detected.
If multiple files are specified, each file format is determined independently
so you can mix and match input types. If multiple files are concatenated
into a stream and presented as standard input, the files must all be of the
same type as the beginning of stream will determine the format.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
same type as the beginning of stream will determine the format.
same type as the beginning of the stream will determine the format.

@mccanne mccanne merged commit 71ed2c2 into main Nov 3, 2024
3 checks passed
@mccanne mccanne deleted the super-help branch November 3, 2024 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants