<PR ID>: (activity this week / total activity) summary
There was 1 Prioritized Active pull request:
-
838: (5/142) local-storage: KNIP-1770: Add proposal for exporting local storage device health metrics (rohantmp)
priority/important-soon
This enhancement proposes that we export health metrics for each local device on each node.
<PR ID>: (activity this week / total activity) summary
There were 11 Other Merged pull requests:
-
571: (3/277) network: Cloud API component for egress IP (alexanderConstantinescu)
OVN-Kubernetes and openshift-sdn today support the feature egress IP on bare-metal platforms. Going forward both network plugins will need to support the feature on conventional cloud provider platforms (such as: GCP, AWS, Azure). Cloud providers require additional management of the egress IP address via the cloud provider's cloud API. That is required since the cloud provider will not let the network plugin claim an additional IP on the node, which it does not recognize.
To be able to do this we will need to add logic to a dedicated component as to be able to handle this. This enhancement proposal goes into detail concerning:
- The cloud provider data model required as to manage egress IP assignments in the cloud
- The model and interaction between this future entity, the network plugin and any additional component required for this interaction to happen.
OpenShift 4.10 aims to finalize this component for cloud egress IP assignments, functional with both openshift-sdn and OVN-Kubernetes, supporting: Azure, GCP and AWS. This document tries however to make an abstraction from the requirements and deadlines set for each cloud w.r.t. each OpenShift release. Hence it does not solely focus on AWS or Azure, but instead aims at defining an architecture which is cloud agnostic and which would be applied with minimal implementation efforts for additional clouds.
-
673: (9/88) machine-api: short-circuiting-backoff (mshitrit)
By using
MachineHealthChecks
a cluster admin can configure automatic remediation of unhealthy machines and nodes. The machine healthcheck controller's remediation strategy is deleting the machine, and letting the cloud provider create a new one. This isn't the best remediation strategy in all environments.Any Machine that enters the
Failed
state is remediated immediately, without waiting, by the MHC. When this occurs, if the error which caused the failure is persistent (spot price too low, configuration error), replacement Machines will also beFailed
. As replacement machines start and fail, MHC causes a hot loop of Machine being deleted and recreated. This hot looping makes it difficult for users to find out why their Machines are failing. Another side effect of machines constantly failing, is the risk of hitting the benchmark of machine failures percentage - thus triggering the "short-circuit" mechanism which will prevent all remediations.With this enhancement we propose a better mechanism. In case a machine enters the
Failed
state and does not have a NodeRef or a ProviderID, it will not be remediated until after a certain time period has passed - thus allowing a manual intervention in order to break to hot loop. -
687: (3/125) storage: Add AWS EFS CSI driver operator (jsafrane)
AWS EFS CSI driver (+ corresponding operator) is an optional OCP component. It should be installed through OLM, when users opts-in.
This document describes existing (unsupported) solution and how to turn it into a supported one, both from code (and build and shipment), and from user perspective.
-
817: (77/308) network: Baremetal IPI Network Configuration for Day-1 (cybertron)
Currently in the IPI flow, there is no way to provide day-1 network configuration which is a common requirement, particularly for baremetal users. We can build on the UPI static networking enhancements to enable such configuration in the IPI flow.
-
850: (8/94) network: Enable flows collection (mariomac)
This document describes configuration changes and requirements at the CNO/OVN level to enable network observability as depicted in the high level Enhancement proposal for Network Observability.
It can be seen as a follow-up of a previous enhancement bringing Network Flows export support to OpenShift when used with OVNKubernetes. In this follow-up, advanced configuration for OvS flows export should be made possible, and a mechanism must be set up to allow interactions between the Cluster Network Operator (CNO) and the Network Observability Operator (NOO - development in progress).
-
866: (2/264) general: Proposed OLM-based Monitoring Stack Solution for RH Managed Services and future needs. (bwplotka)
This document proposes a solution and a delivery plan for scraping, querying and alerting on metrics related to Red Hat Managed Services running in OSD clusters. The proposed solution is not intended to complement the existing monitoring stack managed by the Cluster Monitoring Operator, nor used directly by Customers at this moment.
-
909: (12/96) baremetal: Include CoreOS ISO in the release payload (zaneb)
The baremetal platform is switching from the OpenStack QCOW2 CoreOS image to the live ISO (as also used for UPI). To ensure that existing disconnected clusters can update to this, the ISO image will be included in the release payload. This will be balanced out by removing the RHEL image currently shipped in the release payload.
-
938: (4/4) monitoring: Add monitoring owners (simonpasquier)
@bparees @dhellmann I think that it makes sense for the monitoring team leads and architects to be approvers of the monitoring enhancements. What do you think?
- 917: (3/4) dev-guide: host-port-registry: reserve a port for virtual media TLS (dtantsur)
- 933: (35/95) guidelines: enhancement_template: add API extension sections (sttts)
- 934: (3/3) insights: Insights Operator pulling and exposing SCA entitlement certs - update (tremes)
<PR ID>: (activity this week / total activity) summary
There were 5 Other New pull requests:
-
937: (71/71) monitoring: support configuration of alerting notifications by application owners (simonpasquier)
approved
This document describes a solution that allows OpenShift users to route alert notifications without cluster admin intervention. It complements the existing [user-workload monitoring stack][uwm-docs], enabling a full self-service experience for workload monitoring.
-
939: (94/94) machine-config: [OCPNODE-725] Control Group v2 Enablement on New Clusters (rphillips)
Control Group v2 (cgroup v2) enablement in Kubernetes has progressed to beta upstream. The underlying runtime (cri-o) and supporting subsystems are now ready for customers to begin their own testing with it. Not all workloads will be compatible with cgroup v2, so it will not be enabled by default within OpenShift at this time.
Note: This enhancement is focusing on
pure
mode cgroup v2. Mixed mode environments may behave differently (metrics, vpa, hpa, etc) since cgroup v1 is not compatible with cgroup v2. -
940: (19/19) cluster-logging: [LOG-1815] Enhancement proposal: Add alerts and rules for operator-managed LokiStack (ronensc)
do-not-merge/work-in-progress
This enhancement adds observability capabilities to Loki operator in the form of Prometheus rules and alerts. These rules and alerts are used to identify abnormal conditions and update operators.
-
942: (1/1) cluster-logging: [cluster-logging] Update OWNERS (periklis)
/cc @alanconway @jcantrill
- 935: (8/8) dev-guide: Bug 1951835: reconciliation.md: Update for degraded status propagation during init (jottofar)
<PR ID>: (activity this week / total activity) summary
There were 25 Other Active pull requests:
- 922: (235/286) installer: Allow installer to include/exclude components based on user select install solution (bparees)
- 900: (76/84) network: Multus service abstraction enhancement (s1061123)
- 918: (74/75) installer: vsphere: add multiple datacenter and clusters (jcpowermac)
- 924: (65/131) oc: CLI Manager (deejross)
- 925: (62/170) node-latency-profile: Kubelet<->Controller Manager communication Profiles (harche)
- 920: (48/109) etcd: ETCD-236: etcd: scaling etcd with raft learners (hexfusion)
- 732: (46/208) general: Add Smart NIC OVN offload enhancement (zshi-redhat)
- 929: (28/58) api-review: [OCPNODE-521] New CRD ImageContentPolicy(ICP) to support AllowMirrByTags (QiWang19)
- 913: (26/107) machine-api: Add cluster-api integration enhancement (alexander-demichev)
- 876: (22/71) ingress: Add mutable-publishing-scope enhancement (Miciah)
- 855: (17/109) ingress: ingress: Add bind-options enhancement (m-yosefpor)
- 725: (9/43) distributed-tracing: Allow/Expose options to enable distributed tracing in components as features are added upstream (sallyom)
- 921: (8/101) general: Enhancement proposal for Network Observability (stleerh)
- 931: (6/37) dns: NE-367: Add logLevel and operatorLogLevel APIs for DNS (miheer)
- 881: (6/50) cluster-logging: LOG-1514: LokiStack CR extension for the gateway configuration (sasagarw)
- 912: (4/22) installer: Enhancement proposal for OpenShift IPI on Nutanix AOS with AHV (vnephologist)
- 837: (4/115) insights: Conditional Data Gathering for Insights Operator (Sergey1011010)
- 926: (4/104) network: Proposal to allow mtu changes (jcaamano)
- 722: (3/16) multi-arch: Add "Build OKD for ppc64le" proposal (mjturek)
- 862: (3/117) machine-api: Add Machine Deletion Hooks Proposal (JoelSpeed)
- 907: (2/15) console: CONSOLE-2894: Multi-cluster console (spadgett)
- 201: (2/103) general: bootimages: Downloading and updating bootimages via release image (cgwalters)
- 773: (1/44) installer: Enhancement proposal for OpenShift IPI on IBM Cloud (jeffnowicki)
- 791: (1/72) baremetal: Support RAID and BIOS configuration for baremetal IPI deployments (hs0210)
- 919: (1/12) cluster-logging: Multi-Container-Structured-Logging (alanconway)
<PR ID>: (activity this week / total activity) summary
There were 4 Other Closed pull requests:
- 652: (6/21) node: Enable cgroup v2 support (harche)
- 654: (3/26) dns: ARO private DNS zone resource removal (jim-minter)
- 895: (3/14) builds: WIP - Build CSI Volumes (adambkaplan)
- 941: (5/5) dev-guide: Add upstream CSI ports to proxy to (bertinatto)
<PR ID>: (activity this week / total activity) summary
There was 1 Revived (closed more than 14 days ago, but with new comments) pull request:
- 660: (0/25) cluster-logging: Flow control API enhancements, first draft. (alanconway)
<PR ID>: (activity this week / total activity) summary
There were 14 Idle (no comments for at least 14 days) pull requests:
- 371: (0/31) ingress: Add forwarded-header-policy enhancement (Miciah)
- 736: (0/175) installer: Add enhancement - IBM Cloud provider for Power Virtual Server platform (jaypoulz)
- 738: (0/147) general: Installing OCP on ARM-Based Smart NICs (danwinship)
- 745: (0/120) security: Security Profiles Operator integration in OpenShift (JAORMX)
- 811: (0/63) network: Enhancement proposal for OVN secondary networks (maiqueb)
- 812: (0/48) node: Add proposal about Node Operator (saschagrunert)
- 836: (0/21) etcd: ETCD-58: Add enhancement for automated defragmentation (hexfusion)
- 867: (0/164) node-tuning: Move PAO to OCP enhancement (MarSik)
- 890: (0/61) network: Introduce DPU OVNKube Operator (pliurh)
- 894: (0/62) authentication: authentication/login-logout: initial enhancement (s-urbaniak)
- 902: (0/8) kubelet: Add initial OpenShift swap enhancement (ehashman)
- 618: (0/23) dev-guide: Add more details about host port ownership (danwinship)
- 923: (0/7) windows-containers: [docs] remove invalid web links (selansen)
- 932: (0/1) dev-guide: Host port registry: Add loopback metrics port for SDN (martinkennelly)
<PR ID>: (activity this week / total activity) summary
There were 7 Other lifecycle/stale or lifecycle/rotten pull requests:
- 292: (2/206) machine-api: Add Managing Control Plane machines proposal (enxebre)
- 341: (2/83) maintenance: Machine-maintenance operator proposal (dofinn)
- 343: (2/48) authentication: cluster-wide oauth-proxy settings (deads2k)
- 363: (2/204) cvo: Enhancement for adding upgrade preflight checks for operators (LalatenduMohanty)
- 427: (2/57) update: enhancements/update/phased-rollouts: Propose a new enhancement (wking)
- 905: (1/23) hypershift: Introduce konnectivity-socks5-proxy enhancement (awgreene)
- 893: (2/8) ingress: [fix] Ingress custom http error pages (lgchiaretto)