+
+
+ README
+
+
+
+ opossum
+
+
+
+
+Opossum is a Node.js circuit breaker that executes asynchronous functions
+and monitors their execution status. When things start failing, opossum
+plays dead and fails fast. If you want, you can provide a fallback function
+to be executed when in the failure state.
+For more about the circuit breaker pattern, there are lots of resources
+on the web - search it! Fowler's blog post is one place to
+start reading.
+
+Usage
+Let's say you've got an API that depends on something that might fail -
+a network operation, or disk read, for example. Wrap those functions up in a
+CircuitBreaker
and you have control over your destiny.
+AbortController support
+You can provide an AbortController
(https://developer.mozilla.org/en-US/docs/Web/API/AbortController, https://nodejs.org/docs/latest/api/globals.html#globals_class_abortcontroller) for aborting on going request upon
+reaching Opossum timeout.
+Fallback
+You can also provide a fallback function that will be executed in the
+event of failure. To take some action when the fallback is performed,
+listen for the fallback
event.
+Once the circuit has opened, a timeout is set based on options.resetTimeout
.
+When the resetTimeout
expires, opossum
will enter the halfOpen
state.
+Once in the halfOpen
state, the next time the circuit is fired, the circuit's
+action will be executed again. If successful, the circuit will close and emit
+the close
event. If the action fails or times out, it immediately re-enters
+the open
state.
+When a fallback function is triggered, it's considered a failure, and the
+fallback function will continue to be executed until the breaker is closed.
+The fallback function accepts the same parameters as the fire function:
+Breaker State Initialization
+There may be times where you will need to initialize the state of a Circuit Breaker. Primary use cases for this are in a serverless environment such as Knative or AWS Lambda, or any container based platform, where the container being deployed is ephemeral.
+The toJSON
method is a helper function to get the current state and status of a breaker:
+const breakerState = breaker.toJSON();
+
+This will return an object that might look similar to this:
+{
+ state: {
+ enabled: true,
+ name: 'functionName'
+ closed: true,
+ open: false,
+ halfOpen: false,
+ warmUp: false,
+ shutdown: false
+ },
+ status: {
+ ...
+ }
+};
+
+A new circuit breaker instance can be created with this state by passing this object in:
+const breaker = new CircuitBreaker({state: state});
+
+Status Initialization
+There may also be times where you will need to pre-populate the stats of the Circuit Breaker Status Object. Primary use cases for this are also in a serverless environment such as Knative or AWS Lambda, or any container based platform, where the container being deployed is ephemeral.
+Getting the existing cumalative stats for a breaker can be done like this:
+const stats = breaker.stats;
+
+stats
will be an object that might look similar to this:
+{
+ failures: 11,
+ fallbacks: 0,
+ successes: 5,
+ rejects: 0,
+ fires: 16,
+ timeouts: 0,
+ cacheHits: 0,
+ cacheMisses: 0,
+ semaphoreRejections: 0,
+ percentiles: {
+ '0': 0,
+ '1': 0,
+ '0.25': 0,
+ '0.5': 0,
+ '0.75': 0,
+ '0.9': 0,
+ '0.95': 0,
+ '0.99': 0,
+ '0.995': 0
+ },
+ latencyTimes: [ 0 ],
+ latencyMean: 0
+}
+
+To then re-import those stats, first create a new Status
object with the previous stats and then pass that as an option to the CircuitBreaker constructor:
+const statusOptions = {
+ stats: {....}
+};
+
+const newStatus = CircuitBreaker.newStatus(statusOptions);
+
+const breaker = new CircuitBreaker({status: newStatus});
+
+Browser
+Opossum really shines in a browser. You can use it to guard against network
+failures in your AJAX calls.
+We recommend using webpack to bundle your applications,
+since it does not have the effect of polluting the window
object with a global.
+However, if you need it, you can access a circuitBreaker
function in the global
+namespace by doing something similar to what is shown in the below example.
+Here is an example using hapi.js. See the
+opossum-examples
+repository for more detail.
+Include opossum.js
in your HTML file.
+In your application, set a route to the file, pointing to
+node_modules/opossum/dist/opossum-min.js
.
+In the browser's global scope will be a CircuitBreaker
constructor. Use it
+to create circuit breakers, guarding against network failures in your REST
+API calls.
+Events
+A CircuitBreaker
will emit events for important things that occur.
+Here are the events you can listen for.
+
+fire
- emitted when the breaker is fired.
+reject
- emitted when the breaker is open (or halfOpen).
+timeout
- emitted when the breaker action times out.
+success
- emitted when the breaker action completes successfully
+failure
- emitted when the breaker action fails, called with the error
+open
- emitted when the breaker state changes to open
+close
- emitted when the breaker state changes to closed
+halfOpen
- emitted when the breaker state changes to halfOpen
+fallback
- emitted when the breaker has a fallback function and executes it
+semaphoreLocked
- emitted when the breaker is at capacity and cannot execute the request
+healthCheckFailed
- emitted when a user-supplied health check function returns a rejected promise
+shutdown
- emitted when the breaker shuts down
+
+Handling events gives a greater level of control over your application behavior.
+Promises vs. Callbacks
+The opossum
API returns a Promise
from CircuitBreaker.fire()
.
+But your circuit action - the async function that might fail -
+doesn't have to return a promise. You can easily turn Node.js style
+callback functions into something opossum
understands by using the built in
+Node core utility function util.promisify()
.
+And just for fun, your circuit doesn't even really have to be a function.
+Not sure when you'd use this - but you could if you wanted to.
+Calculating errorThresholdPercentage
+The errorThresholdPercentage
value is compared to the error rate. That rate is determined by dividing the number of failures by the number of times the circuit has been fired. You can see this comparison here:
+The numbers for fires
and failures
come from the stats that are indeed governed by rollingCountTimeout
and rollingCountBuckets
. The timeout value is the total number of seconds for which the stats are being maintained, and the buckets value is the number of slots in the window. The defaults are 10 seconds and 10 buckets. So, the statistics that are being compared against errorThresholdPercentage
are based on 10 samples, one per second over the last 10 seconds.
+Example: a circuit is fired 24 times over 10 seconds with a somewhat bursty pattern, failing three times.
+| fires: 2 | fires: 1 | fires: 3 | fires: 0 | fires: 9 | fires: 3 | fires: 2 | fires: 0 | fires: 4 | fires: 0 |
+| fails: 0 | fails: 0 | fails: 0 | fails: 0 | fails: 0 | fails: 3 | fails: 0 | fails: 0 | fails: 0 | fails: 0 |
+
+The failure rate here is 3/24 or 1/8 or 12.5%. The default error threshold is 50%, so in this case, the circuit would not open. However, if you modified the rollingCountTimeout
to 3 seconds, and the rollingCountBuckets
to 3 (not recommended), then the stats array might look like these three seconds from above.
+| fires: 3 | fires: 2 | fires: 0 |
+| fails: 3 | fails: 0 | fails: 0 |
+
+Now, without changing errorThresholdPercentage
our circuit will open because our error rate is now 3/5 or 60%. It's tricky to test this stuff because the array of statistics is a rolling count. Every second the oldest bucket is removed and a new one is added, so the totals change constantly in a way that may not be intuitive.
+For example, if the first example is shifted right, dropping the first bucket and adding another with fires: 3
the total number of fires
now in the stats is not 27 (24+3) but 25 (24-2+3).
+The code that is summing the stats samples is here:
+Typings
+Typings are available here.
+If you'd like to add them, run npm install @types/opossum
in your project.
+Metrics
+Prometheus
+The opossum-prometheus
module
+can be used to produce metrics that are consumable by Prometheus.
+These metrics include information about the circuit itself, for example how many
+times it has opened, as well as general Node.js statistics, for example event loop lag.
+Hystrix
+The opossum-hystrix
module can
+be used to produce metrics that are consumable by the Hystrix Dashboard.
+Troubleshooting
+You may run into issues related to too many listeners on an EventEmitter
like this.
+In some cases, seeing this error might indicate a bug in client code, where many CircuitBreaker
s are inadvertently being created. But there are legitimate scenarios where this may not be the case. For example, it could just be that you need more than 10 CircuitBreaker
s in your app. That's ok.
+To get around the error, you can set the number of listeners on the stream.
+Or it could be that you have a large test suite which exercises some code that creates CircuitBreaker
s and does so repeatedly. If the CircuitBreaker
being created is only needed for the duration of the test, use breaker.shutdown()
when the circuit is no longer in use to clean up all listeners.
+
+
+