Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suppress scope value owner migration events. #2195

Merged
merged 3 commits into from
Oct 24, 2024

Conversation

SpicyLemon
Copy link
Contributor

@SpicyLemon SpicyLemon commented Oct 21, 2024

Description

This PR updates the metadata module migration from v3 to v4 (to move value-owner records into the bank module). It will no longer emit any events when run on a mainnet node. Any events for other parts of the upgrade will still be included, though, just not the events from the metadata module migration.

For each scope value owner, there is at least 6 events (more if they've opted into quarantine). Both mainnet and testnet have around 300,000 scopes and most (if not all) have a value owner. That's around 1.8M events that would be emitted.

On testnet, when we ran the viridian-rc1 upgrade, it included all of those events in the block-results for the upgrade height. I had to write a custom executable to extract it, and as JSON, it was almost 400MB. We could not find a way to allow a node to actually return that info via the block-results query.

In some unit tests I wrote and played with, reading the block result from state and unmarshalling it always took less than half a second. But that's a small fraction of the amount of time between request and timeout. My theory is that some limit is being hit, causing a panic in a sub-process that isn't being properly handled. When the panic happens, the request processing is halted and the node never sends any sort of response, leading ultimately to a timeout from something. Further, I believe that when that panic happens, there's memory that has been allocated, but not released, leading to a memory leak. This might explain why some modes had memory problems (some causing system crashes) after the upgrade.

So, to be on the safe side, we'll omit the events from the metadata module migration.

Also, so that v1.20.0-rc2 and v1.20.0 can be state compatible, this change only affects non-testnet nodes. That way, if you're recreating a node, playing through all the blocks, then, on the viridian-rc1 upgrade height, you can switch to v1.20.0 and get the correct result. I.e. for a testnet node, all the events will still be emitted during the metadata v3 to v4 module migration.


Before we can merge this PR, please make sure that all the following items have been
checked off. If any of the checklist items are not applicable, please leave them but
write a little note why.

  • Targeted PR against correct branch (see CONTRIBUTING.md).
  • Linked to Github issue with discussion and accepted design OR link to spec that describes this work.
  • Wrote unit and integration tests
  • Updated relevant documentation (docs/) or specification (x/<module>/spec/).
  • Added relevant godoc comments.
  • Added relevant changelog entries under .changelog/unreleased (see Adding Changes).
  • Re-reviewed Files changed in the Github PR explorer.
  • Review Codecov Report in the comment section below once CI passes.

Summary by CodeRabbit

  • New Features

    • Introduced a no-operation event manager to suppress event emissions during metadata migration, enhancing performance.
    • Conditional event suppression based on the environment (testnet vs. others) during metadata migration.
  • Bug Fixes

    • Improved error handling and logging during the migration of metadata from version 3 to 4.
  • Documentation

    • Updated migration logic documentation to reflect changes in event management and value owner handling.

@SpicyLemon SpicyLemon marked this pull request as ready for review October 21, 2024 22:58
@SpicyLemon SpicyLemon requested a review from a team as a code owner October 21, 2024 22:58
Copy link
Contributor

coderabbitai bot commented Oct 21, 2024

Walkthrough

This pull request introduces changes to suppress event emissions during the metadata migration process and implements a no-operation event manager. The Migrate3To4 function in the keeper package is modified to utilize this no-op event manager unless the context indicates a testnet environment. Additionally, new interfaces and structs are defined to facilitate the migration process, ensuring proper handling of value owner data. The changes focus on enhancing the migration logic while maintaining the overall structure of the migration process.

Changes

File Path Change Summary
.changelog/unreleased/improvements/2195-hide-md-mig-events.md Documented the suppression of events during metadata migration.
internal/sdk/events.go Introduced NoOpEventManager struct implementing sdk.EventManagerI with various no-op methods.
x/metadata/keeper/migrations_v4.go Modified Migrate3To4 to use a no-op event manager; added keeper3To4I interface and struct.

Possibly related PRs

Suggested reviewers

  • Taztingo
  • scirner22

Poem

🐇 In the meadow where events once danced,
A hush now falls, as silence pranced.
With no-op magic, we softly tread,
Migrating data, where once it led.
A hop, a skip, through versions we roam,
In quietude, we find our home. 🌼


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (2)
internal/sdk/events.go (2)

1-8: Consider renaming the package to avoid confusion with imports.

The package name sdk might lead to confusion as it's the same as one of the imported packages. Consider renaming it to something more specific, like noopevents or eventssuppression, to clearly indicate its purpose and avoid potential naming conflicts.


1-46: Overall implementation aligns well with PR objectives.

The NoOpEventManager implementation successfully addresses the PR objective of suppressing event emissions during the metadata migration process. Key points:

  1. All methods of the sdk.EventManagerI interface are correctly implemented to do nothing or return empty results.
  2. The code is well-structured, with clear comments explaining the purpose and behavior of each component.
  3. The implementation will effectively prevent the generation of a large number of events during migration, addressing the performance issues and potential memory leaks mentioned in the PR objectives.

This no-op event manager can be selectively used for non-testnet nodes, allowing for the suppression of events on mainnet while still emitting events on testnet, as specified in the PR objectives.

Consider adding a configuration option or environment variable to enable/disable this no-op event manager, allowing for more flexible control over event suppression in different environments or scenarios.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 0bf059c and 476152a.

📒 Files selected for processing (3)
  • .changelog/unreleased/improvements/2195-hide-md-mig-events.md (1 hunks)
  • internal/sdk/events.go (1 hunks)
  • x/metadata/keeper/migrations_v4.go (1 hunks)
✅ Files skipped from review due to trivial changes (1)
  • .changelog/unreleased/improvements/2195-hide-md-mig-events.md
🧰 Additional context used
📓 Learnings (1)
x/metadata/keeper/migrations_v4.go (2)
Learnt from: SpicyLemon
PR: provenance-io/provenance#2140
File: x/metadata/keeper/migrations_v4.go:57-80
Timestamp: 2024-10-02T02:06:21.664Z
Learning: The `V3WriteNewScope` method in `x/metadata/keeper/migrations_v4.go` must exist until after the migration has been executed and will be deleted as part of the `viridian` cleanup.
Learnt from: SpicyLemon
PR: provenance-io/provenance#2140
File: x/metadata/keeper/migrations_v4.go:57-80
Timestamp: 2024-10-08T18:12:51.935Z
Learning: The `V3WriteNewScope` method in `x/metadata/keeper/migrations_v4.go` must exist until after the migration has been executed and will be deleted as part of the `viridian` cleanup.
🔇 Additional comments (10)
internal/sdk/events.go (6)

10-13: LGTM: NoOpEventManager struct and interface assertion.

The NoOpEventManager struct is correctly declared as an empty struct, and the interface assertion ensures it implements sdk.EventManagerI. The comment clearly explains the purpose of the struct.


15-18: LGTM: NewNoOpEventManager function.

The NewNoOpEventManager function is correctly implemented, creating and returning a new NoOpEventManager instance. The comment clearly explains the purpose of the function.


20-30: LGTM: Events and ABCIEvents methods.

Both Events and ABCIEvents methods are correctly implemented, returning empty events instead of nil to match sdk.EventManager behavior. This approach ensures consistency and prevents potential nil pointer issues. The comments clearly explain the reasoning behind the implementation.


32-40: LGTM: EmitTypedEvent and EmitTypedEvents methods.

Both EmitTypedEvent and EmitTypedEvents methods are correctly implemented for a no-op event manager. They ignore their arguments and always return nil, which is the expected behavior. The comments clearly explain this behavior.


42-43: LGTM: EmitEvent method.

The EmitEvent method is correctly implemented for a no-op event manager. It ignores its argument and does nothing, which is the expected behavior. The comment clearly explains this behavior.


45-46: LGTM: EmitEvents method.

The EmitEvents method is correctly implemented for a no-op event manager. It ignores its argument and does nothing, which is the expected behavior. The comment clearly explains this behavior.

x/metadata/keeper/migrations_v4.go (4)

13-13: Confirm usage of internal package import

The code imports internalsdk from an internal package:

internalsdk "github.com/provenance-io/provenance/internal/sdk"

Importing from an internal package is generally intended for use only within that package and may lead to visibility or compatibility issues. Please verify that importing internalsdk here is acceptable and will not cause future maintainability problems.


Line range hint 57-79: Ensure retention of V3WriteNewScope for migration purposes

The V3WriteNewScope function is marked as deprecated and intended for removal post-migration:

// Deprecated: Only exists to facilitate testing of the migration of the metadata module from v3 to v4.
func (k Keeper) V3WriteNewScope(ctx sdk.Context, scope types.Scope) error {
    // function body...
}

According to the retrieved learnings, this method must remain until after the migration is executed and will be deleted during the viridian cleanup. Please ensure that this function is retained until it is safe to remove.


28-28: ⚠️ Potential issue

Assess security implications of bypassing marker restrictions

The migration bypasses marker send restrictions with:

ctx = markertypes.WithBypass(ctx)

While this is necessary to facilitate the migration, please verify that:

  • Scope Limitation: The bypass is strictly limited to the migration process and does not persist beyond it.
  • Security Measures: Adequate safeguards are in place to prevent unauthorized access or actions during the bypass.
  • Restoration of Context: Normal marker restrictions are reinstated after the migration to maintain system integrity.

20-26: ⚠️ Potential issue

Review event suppression logic during migration

The code conditionally suppresses event emissions during migration on non-testnet environments:

// Lines 24-26
if sdk.GetConfig().GetBech32AccountAddrPrefix() != "tp" {
    ctx = ctx.WithEventManager(internalsdk.NewNoOpEventManager())
}

While this approach addresses performance issues caused by excessive events, please ensure that:

  • Event Dependencies: Suppressing events does not negatively impact any modules or processes that rely on these events during migration.
  • Network Identification: The check for the Bech32 account address prefix accurately identifies the testnet environment. If other environments share the "tp" prefix or if there are additional prefixes in use, consider refining the condition.
  • Future Compatibility: This logic remains valid for future testnets or network changes.

You can use the following script to verify that no critical dependencies on these events exist:

✅ Verification successful

Event Suppression Logic Verified Successfully

The event suppression during migration does not impact other modules or processes.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Verify if any modules depend on migration events.

# Search for any handlers or listeners that might be affected.
rg 'AddEventListener|HandleEvent|OnEvent' -A 5

# Check for any usage of migration events in the codebase.
rg 'Migrate3To4' -A 20 | rg 'EmitEvent|EventManager'

Length of output: 200

@SpicyLemon SpicyLemon enabled auto-merge (squash) October 22, 2024 16:30
@SpicyLemon SpicyLemon added this to the v1.20.0 milestone Oct 23, 2024
@SpicyLemon SpicyLemon merged commit 2d557cb into main Oct 24, 2024
35 checks passed
@SpicyLemon SpicyLemon deleted the dwedul/hide-md-mig-events branch October 24, 2024 13:12
SpicyLemon added a commit that referenced this pull request Oct 24, 2024
* Create a no-op event manager and use that during the metadata module migration.

* Do not suppress the events for a testnet upgrade since they were emitted when the migration ran on testnet.

* Add changelog entry.
SpicyLemon added a commit that referenced this pull request Oct 24, 2024
…tion events), #2196 (timeout_commit), #2197 (recordspec cmd), #2198 (ParameterChangeProposal) #2199 (wasm build-address cmd). (#2200)

* Suppress scope value owner migration events. (#2195)

* Create a no-op event manager and use that during the metadata module migration.

* Do not suppress the events for a testnet upgrade since they were emitted when the migration ran on testnet.

* Add changelog entry.

* Update all the spec proto links to reference v1.20.0 (#2192)

* Update all the spec proto links to reference v1.20.0 (instead of 1.19.0).

* Add changelog entry.

* When prepping a release, combine the dependency bump changelog entries. (#2181)

* Add a note to get-dep-changes to alert folks that changing those formats might break other things.

* Create an awk script that will combine dependency changelog entries. Update prep-release to use it. Also apply a couple fixes that are alread in the release branch (and will be in main shortly). Also tweak the step 4 and 5 names to provide more context, and fix the verbose output header when recombining the sections.

* Add changelog entry.

* Clarify the new comment in get-dep-changes.sh.

* Update stuff that uses or talks about RELEASE_NOTES.md because it should actually be RELEASE_CHANGELOG.md. The SDK uses _NOTES but only puts a blurb in there, so it's not a changelog. But we include a changelog, so it makes sense to keep it named that way.

* Fix the `query metadata recordspec` command when given a rec-spec-id. (#2197)

* [2148]: Fix the query metadata recordspec command to correctly use the RecordSpecification query (instead of RecordSpecificationsForContractSpecification) when provided a record specification id.

* [2148]: Add changelog entry.

* Fix decoding of gov props with a ParameterChangeProposal in them. (#2198)

* Write a unit test that fails to parse a gov proposal with a ParameterChangeProposal in it because that type isn't being registered anymore.

* Register the params module stuff with the codecs since there's some gov props with a ParameterChangeProposal in them.

* Add changelog entry.

* Hard-code consensus.timeout_commit to 3.5s for mainnet. (#2196)

* [2121]: Change the default consensus timeout value to 3.5 seconds.

* [2121]: Hard-code the consensus.timeout_commit value.

* [2121]: Fix TestIsTestnetFlagSet to not be affected by existing env vars.

* [2121]: Fix a couple unit tests that broke when I changed the default commit timout.

* [2121]: Only hard-code the timeout commit on non-testnets.

* [2121]: Change the default back to 1.5s for faster default testnets.

* [2121]: Fix the TestPreUpgradeCmd that broke because of the hard-coded timeout commit.

* [2121]: Add some unit tests that make sure the consensus timeout commit value is behaving as expected.

* [2121]: Add changelog entry.

* [2121]: When forcing the timeout_commit to be 3.5 seconds, also force the skip flag to be false.

* [2121]: Update warnAboutSettings: Evaluate the timeout commit and skip-timeout-commit fields separately. Issue a warning if skip-timeout-commit is true. Issue a warning if the timeout commit is not exactly what we want it to be.

* Fix: Add node flag to WASM queries (build-address) (#2199)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

Successfully merging this pull request may close these issues.

3 participants