Add support for zones #32

Copis · 2020-05-12T23:06:17Z

Is your feature request related to a problem? Please describe.
We have a master zone and some satellite zones behind a vpn or firewall. In that cases the master couldn't receive traps.

Describe the solution you'd like
Whould be great to be abble to receibe these snmp traps in one satellite endpoing and sent the status to master

Describe alternatives you've considered
Forward snmp tramps from satellite to master

patrickpr · 2020-05-13T15:53:15Z

Hi,

It's a good feature, I will start working on it for next version.

Are you able to test this (my lab environment does not include master/sattellite setting) ?

robdevops · 2020-05-26T11:51:54Z

I am about to do a multi-zone build, and will be able to test this in coming days/weeks.

Copis · 2020-06-09T07:47:47Z

I can test this scenario in my developement environment with one master/one statellite but i think should be better to test into ha environment with two masters/two satellites if it's possible.

patrickpr · 2020-06-15T15:47:36Z

For update : I'm currently building the test environement for this.

patrickpr · 2020-07-06T20:20:37Z

@Copis : architecture of satellites is work in progress.

Test environment : two masters in HA and two satellites in HA.

Traps can be received by :

master ( if there is a HA master using VRRP (keealived) IP)
satellite (if there is a HA sat, using VRRP too).

Satellite receives and process traps using configuration provided by masters and :

update database using a simple API provided by trapdirector module on masters.
Send passive service check results to satellites (or to master, this isn't decided yet).

For now, there is no zone for trap rules : they are global.

I assume :

satellites can have access to master (and masterHA) on :

Icinga API port (5665 by default)
Icingaweb2 HTTP port (443)
(Satellites will use a specific Icingaweb2 user)

Master and master HA both have access to the trapdirector database.
Latency between master(s) and sat(s) is low (<500ms)

I'm opened to comments and suggestions !

Copis · 2020-09-01T13:15:10Z

One of the problems that i see is in some scenarios cannot have VRRP for example in Active-Passive or Active-Active CPD with no extended vlans. In that case there are no posible implementation

patrickpr · 2020-09-03T08:31:35Z

Opened a topic here to talk about it : https://community.icinga.com/t/trapdirector-ha-feature/5439

p4k8 · 2020-09-03T12:12:20Z

So here are some thoughts about it:

As long as all instances of trap director talk to the same DB, it shouldn't matter how many there are.
Traps can be forwarded from any nodes they can be received to any snmptrapd on trapdirector nodes. This enables chaining them through firewalls to the nodes where they can be processed properly.
When trapdirector processes trap, it sends result to API of satellite/master. Why not both in a configurable order? So if you send result to satellite and you don't like return or its unreachable, you resend it to master or another satellite.
In this scenario you'd have to worry about deduplication of traps if you choose to do HA by trying to send traps to all existing trapdirector instances which don't know about each other but share DB. Maybe theres even some cheap way to discard duplicates which is better than DB lookup for last 5 seconds worth of traps to see if it was already processed by fellow trapdirectors.

patrickpr · 2020-09-03T16:59:47Z

1. As long as all instances of trap director talk to the same DB, it shouldn't matter how many there are.

Correct, but DB connexion may be impossible on distant sites.

2. Traps can be forwarded from any nodes they _can_ be received to any snmptrapd on trapdirector nodes. This enables chaining them through firewalls to the nodes where they can be processed properly.

Some kind of trap routing ? Not very easy to implement !!!

3. When trapdirector processes trap, it sends result to API of satellite/master. Why not both in a configurable order? So if you send result to satellite and you don't like return or its unreachable, you resend it to master or another satellite.

Yes : satellite then master or master only (maybe set this by zones ?)

4. In this scenario you'd have to worry about deduplication of traps if you choose to do HA by trying to send traps to all existing trapdirector instances which don't know about each other but share DB. Maybe theres even some cheap way to discard duplicates which is better than DB lookup for last 5 seconds worth of traps to see if it was already processed by fellow trapdirectors.

There is a special 'waiting' status in DB that was implemented for this kind of things.

p4k8 · 2020-09-04T05:39:51Z

DB connexion may be impossible on distant sites

So thats why it might be sound idea not to make any trapdirectors on distant sites. Like
DB <--> trapdirector <--snmptrapd on trapdirector host <-- firewalls/networks/whatever <-- snmptrapd with forward directive on remote site
"HA" in this part is achieved by forwarding traps from remote host to several trapdirector destinations simultaneously and then each of the trapdirectors would have list of API endpoints to send check result to.
So that would mean getting trap at least once, and maximum as many as there are snmptrapd forward destinations. That's solved by deduping stuff I guess.

Some kind of trap routing

More like, just adding forward default <address> to snmptrapd.conf pointing at snmptrapd on proper trapdirector node.

maybe set this by zones

Not sure if it actually has to be zone-aware to work properly as long as the endpoint addresses are listed in the correct order.

patrickpr self-assigned this May 13, 2020

patrickpr added the enhancement New feature or request label May 13, 2020

patrickpr mentioned this issue Sep 1, 2020

API Master #45

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for zones #32

Add support for zones #32

Copis commented May 12, 2020

patrickpr commented May 13, 2020

robdevops commented May 26, 2020

Copis commented Jun 9, 2020

patrickpr commented Jun 15, 2020

patrickpr commented Jul 6, 2020

Copis commented Sep 1, 2020 •

edited

Loading

patrickpr commented Sep 3, 2020

p4k8 commented Sep 3, 2020

patrickpr commented Sep 3, 2020

p4k8 commented Sep 4, 2020 •

edited

Loading

Add support for zones #32

Add support for zones #32

Comments

Copis commented May 12, 2020

patrickpr commented May 13, 2020

robdevops commented May 26, 2020

Copis commented Jun 9, 2020

patrickpr commented Jun 15, 2020

patrickpr commented Jul 6, 2020

Copis commented Sep 1, 2020 • edited Loading

patrickpr commented Sep 3, 2020

p4k8 commented Sep 3, 2020

patrickpr commented Sep 3, 2020

p4k8 commented Sep 4, 2020 • edited Loading

Copis commented Sep 1, 2020 •

edited

Loading

p4k8 commented Sep 4, 2020 •

edited

Loading