Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diagnosis on the issue of no availability of instances in the RP_IVR Callback service #82

Closed
3 tasks done
Sachinbisht27 opened this issue Feb 12, 2024 · 2 comments
Closed
3 tasks done
Assignees
Labels
bug Something isn't working Point: 1

Comments

@Sachinbisht27
Copy link
Member

Sachinbisht27 commented Feb 12, 2024

Describe the bug
Currently, we are facing some error logs on the Monitoring Dashboard that there are a few instances when the Google Cloud function is not able to handle the sudden load and fails to scale up in certain cases.
Here is the error log:

The request was aborted because there was no available instance.
Additional troubleshooting documentation can be found at: https://cloud.google.com/functions/docs/troubleshooting#scalability

To Reproduce
Check the error logs of the Cloud function

Expected behavior/Action Items:

  • Check the root cause and come up with the required solution.
  • As the request load started increasing, need to come up with a plan for how we are going to handle this will increase in the future.

Screenshots
image

Additional context
NA

Acceptance Criteria

  • The auto-scaling should work smoothly and should handle the upcoming load

Documentation
Add whatever documentation will be required here.

@Sachinbisht27 Sachinbisht27 added the bug Something isn't working label Feb 12, 2024
@Sachinbisht27 Sachinbisht27 changed the title [BUG] [BUG] Diagnosis on the issue of no availability of instances in the RP_IVR Callback service Feb 12, 2024
@Sachinbisht27 Sachinbisht27 changed the title [BUG] Diagnosis on the issue of no availability of instances in the RP_IVR Callback service Diagnosis on the issue of no availability of instances in the RP_IVR Callback service Feb 12, 2024
@Sachinbisht27 Sachinbisht27 self-assigned this Feb 15, 2024
@Sachinbisht27
Copy link
Member Author

Sachinbisht27 commented Feb 26, 2024

Analysis of the issue

Diagnosis:
Sometimes, our existing infrastructure faced challenges in managing the application load. There were instances where we received webhooks from Kookoo, but unfortunately, our infrastructure is insufficient to capture and process them.

Current Observations:
On crossing the 10 instance count in the cloud function, we are getting the error on the monitoring dashboard and the failure for the webhook processing is occurring.

Impact:
Approx. 500-600 webhook failures in a month.

Solution:
Ways in which we can enhance the capability of current infrastructure. A way is to implement the queue-based approach.

Action Item:

@Sachinbisht27
Copy link
Member Author

Closing this issue as the diagnosis is completed and created a new issue for the Implementation of the Queue service in the RP_IVR callback. DostEducation/rp-ivr-webhook-handler#11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Point: 1
Projects
None yet
Development

No branches or pull requests

1 participant