Analysis Definition in Kubernetes

As the administrator of your FLAME Node, it is important to understand how the analyses are being deployed on your system. While the publicly available helm chart for the FLAME Node makes it easy to see exactly how the components are defined, the analyses themselves are deployed using the Pod Orchestrator using the Python Kubernetes library making it more difficult to quickly assess their configuration. This page covers the various Kubernetes resources that are deployed when an analysis is started as well as how certain parameters can be modified to fit your security requirements.

Here is a brief overview of the resources that are deployed when initiating an analysis:

Analysis Deployment
- Analysis Pod
NGINX Deployment
- NGINX Pod
Services
ConfigMap
NetworkPolicy

Analysis Deployment

Kubernetes Deployments

A Kubernetes Deployment is a resource which manages the creation and updating of Pods, Kubernetes resources built around Docker containers. Each Deployment is configured to create a ReplicaSet which is responsible for maintaining a specified number of Pod replicas at any given time, serving as backups in case the main Pod fails due unforeseen reasons. By using Deployments, the workload can be better managed, and also allow one to monitor the state of the rollout and scale the workload as needed. Additionally, this template enables the use of Labels and Selectors which are subsequently used to apply NetworkPolicies to the generated Pods, thus controlling the traffic in and out of the container.

Though the code for each analysis is manually reviewed and approved prior to running on any node, FLAME operates under a minimal-to-none trust assumption. For an analysis this means, precautions are taken to lock down all traffic both entering and leaving the Analysis Pod in which it is running in the Kubernetes cluster. By default, every analysis deployed by the Pod Orchestrator has a restrictive Network Policy (see NetworkPolicy below) applied to it using the labels and selectors, isolating it fully from the surrounding network.

However, the analysis cannot run in absolute isolation since it still requires access to the data and needs to be able to communicate with other FLAME components in order to access data, send progress updates and intermediate and final results. When a FLAME Analysis is started, the Pod Orchestrator service creates two separate Deployments: one for the analysis itself, and one for a NGINX instance. The Analysis Deployment executes the analysis script, while the NGINX Deployment acts as its forward proxy. In order to not infringe on the hosting system's safety, the Analysis Deployment is subject to very stringent policies in place to severely limit its traffic, making the NGINX Deployment its single possible point of interaction (outside of Kubernetes' own kube-dns service responsible for resolving the Analysis Deployment's cluster IP address). Details on how this NGINX sidecar controls traffic is discussed in the NGINX Deployment section.

An additional security measure to ensure that the Analysis Pod was created using the FLAME pipeline is verification using the OIDC protocol and the included Keycloak instance. Keycloak serves as an identity provider (IDP) to check whether requests made by certain components or services are who they say they are by bundling a JSON Web Token (JWT) with their request. Our software registers the FLAME services with Keycloak upon deployment and communication is authenticated using the OAuth2 endpoints.

Template

Each Analysis Deployment is defined following this template:

yaml

kind: Deployment
apiVersion: apps/v1
metadata:
  name: <analysis deployment name>
  namespace: default
  labels:
    app: <analysis deployment name>
    component: flame-analysis
spec:
  replicas: 1
  selector:
    matchLabels:
      app: <analysis deployment name>
      component: flame-analysis
  template:
    metadata:
      labels:
        app: <analysis deployment name>
        component: flame-analysis
    spec:
      containers:
        - name: <analysis deployment name>
          image: <URL to image in Harbor>
          ports:
            - containerPort: 8000
              protocol: TCP
          env:
            - name: DATA_SOURCE_TOKEN
              value: none_needed
            - name: KEYCLOAK_TOKEN
              value: <randomly generated token>
            - name: ANALYSIS_ID
              value: <analysis UUID>
            - name: PROJECT_ID
              value: <project UUID>
            - name: DEPLOYMENT_NAME
              value: <analysis deployment name>
          resources: {}
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: IfNotPresent
      restartPolicy: Always
      terminationGracePeriodSeconds: 30
      dnsPolicy: ClusterFirst
      securityContext: {}
      imagePullSecrets:
        - name: flame-harbor-credentials
      schedulerName: default-scheduler
  strategy:
    type: RollingUpdate
    rollingUpdate:
      maxUnavailable: 25%
      maxSurge: 25%
  revisionHistoryLimit: 10
  progressDeadlineSeconds: 600
status:
  observedGeneration: 1
  replicas: 1
  updatedReplicas: 1
  readyReplicas: 1
  availableReplicas: 1
  conditions:
    - type: Available
      status: 'True'
      reason: MinimumReplicasAvailable
      message: Deployment has minimum availability.
    - type: Progressing
      status: 'True'
      reason: NewReplicaSetAvailable
      message: >-
        ReplicaSet "<analysis deployment name>"
        has successfully progressed.

NGINX Deployment

NGINX serves as a sidecar proxy, acting as the only means of communication between the Analysis Pod and everything else. The partnered analysis container can communicate only through the endpoints defined within this NGINX Deployment (endpoint descriptions can be found in the ConfigMap section).

Template

Each NGINX Deployment can be defined with the following template:

yaml

kind: Deployment
apiVersion: apps/v1
metadata:
  name: nginx-<analysis deployment name>
  namespace: default
  generation: 1
  labels:
    app: nginx-<analysis deployment name>
    component: flame-analysis-nginx
spec:
  replicas: 1
  selector:
    matchLabels:
      app: nginx-<analysis deployment name>
  template:
    metadata:
      labels:
        app: nginx-<analysis deployment name>
        component: flame-analysis-nginx
    spec:
      volumes:
        - name: nginx-vol
          configMap:
            name: nginx-<analysis deployment name>-config
            items:
              - key: nginx.conf
                path: nginx.conf
            defaultMode: 420
      containers:
        - name: nginx-<analysis deployment name>
          image: nginx:latest
          ports:
            - containerPort: 80
              protocol: TCP
          resources: {}
          volumeMounts:
            - name: nginx-vol
              mountPath: /etc/nginx/nginx.conf
              subPath: nginx.conf
          livenessProbe:
            httpGet:
              path: /healthz
              port: 80
              scheme: HTTP
            initialDelaySeconds: 15
            timeoutSeconds: 5
            periodSeconds: 20
            successThreshold: 1
            failureThreshold: 1
          terminationMessagePath: /dev/termination-log
          terminationMessagePolicy: File
          imagePullPolicy: Always
      restartPolicy: Always
      terminationGracePeriodSeconds: 30
      dnsPolicy: ClusterFirst
      securityContext: {}
      schedulerName: default-scheduler
  strategy:
    type: RollingUpdate
    rollingUpdate:
      maxUnavailable: 25%
      maxSurge: 25%
  revisionHistoryLimit: 10
  progressDeadlineSeconds: 600
status:
  observedGeneration: 1
  replicas: 1
  updatedReplicas: 1
  readyReplicas: 1
  availableReplicas: 1
  conditions:
    - type: Available
      status: 'True'
      reason: MinimumReplicasAvailable
      message: Deployment has minimum availability.
    - type: Progressing
      status: 'True'
      reason: NewReplicaSetAvailable
      message: >-
        ReplicaSet
        "nginx-<analysis deployment name>-<unique ID>" has
        successfully progressed.

Services

Kubernetes Service

A Kubernetes Service is what allows a specific application to be exposed and accessed within a Pod. Kubernetes creates an endpoint to the specified Pod and given port, and because Pods are ephemeral, a separate Service resource is required to direct traffic to correct Pod.

For the FLAME Node software, services are created for both analysis and NGINX Pods at ports 8000 and 80, respectively. These services use the same labels and selectors as the Deployments, and thus are subject the same restrictive Network Policy.

Below are the configurations for both the Analysis and NGINX Services.

Analysis Service Template

yaml

kind: Service
apiVersion: v1
metadata:
  name: <analysis deployment name>
  namespace: default
  labels:
    app: <analysis deployment name>
    component: flame-analysis
spec:
  ports:
    - protocol: TCP
      port: 80
      targetPort: 8000
  selector:
    app: <analysis deployment name>
  type: ClusterIP

NGINX Service Template

yaml

kind: Service
apiVersion: v1
metadata:
  name: nginx-<analysis deployment name>
  namespace: default
  labels:
    app: nginx-<analysis deployment name>
    component: flame-analysis-nginx
spec:
  ports:
    - protocol: TCP
      port: 80
      targetPort: 80
  selector:
    app: nginx-<analysis deployment name>
  type: ClusterIP

ConfigMap

A Kubernetes ConfigMap is used in the FLAME Node to define the endpoints for the NGINX sidecar Pod by providing the imported NGINX configuration. Because the traffic from the Analysis Pod is tightly controlled, it is necessary to pre-define the NGINX endpoints which this Pod can use for communication and transmitting results. The endpoints are to enable communication with other FLAME Node components and are configured such that each one will only accept connections from the analysis container (as defined by its Service IP).

Incoming Messages

Note: One endpoint, /analysis, is configured to allow the FLAME Message Broker and Pod Orchestrator to send/retrieve information to/from the Analysis Pod and serves as the only point of ingress to the analysis.

The ConfigMap resource name is formatted as nginx-<analysis deployment name>-config and defined as such:

yaml

kind: ConfigMap
apiVersion: v1
metadata:
  name: nginx-<analysis deployment name>-config
  namespace: default
  labels:
    component: flame-nginx-analysis-config-map
data:
  nginx.conf: |2-

                worker_processes 1;
                events { worker_connections 1024; }
                http {
                    sendfile on;
                    
                     server {
                        listen 80;
                        
                        client_max_body_size 0;
                        chunked_transfer_encoding on;
                        
                        proxy_redirect off;
                        proxy_set_header Host $host;
                        proxy_set_header X-Real-IP $remote_addr;
                        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
                        proxy_set_header X-Forwarded-Proto $scheme;
                        
                        # health check
                        location /healthz {
                            return 200 'healthy';
                        }
                        
                        
                        # egress: analysis deployment to kong
                        location /kong {
                            rewrite     ^/kong(/.*) $1 break;
                            proxy_pass  http://flame-node-kong-proxy;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                        }
                        
                        
                        # egress: analysis deployment to result-service
                        location ~ ^/storage/(final|local|intermediate)/ {
                            rewrite     ^/storage(/.*) $1 break;
                            proxy_pass  http://flame-node-node-result-service:8080;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                        }
                        
                        
                        # egress: analysis deployment to hub-adapter
                        location /hub-adapter/kong/datastore/<project UUID> {
                            rewrite     ^/hub-adapter(/.*) $1 break;
                            proxy_pass http://flame-node-hub-adapter-service:5000;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                        }
                        
                        
                        # egress: analysis deployment to message broker: participants
                        location ~ ^/message-broker/analyses/<analysis UUID>}/participants(|/self) {
                            rewrite     ^/message-broker(/.*) $1 break;
                            proxy_pass  http://flame-node-node-message-broker;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                        }
                        # egress: analysis deployment to message broker: analysis message
                        location ~ ^/message-broker/analyses/<analysis UUID>/messages(|/subscriptions) {
                            rewrite     ^/message-broker(/.*) $1 break;
                            proxy_pass  http://flame-node-node-message-broker;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                        }
                        # egress: analysis deployment to message broker: healthz
                        location /message-broker/healthz {
                            rewrite     ^/message-broker(/.*) $1 break;
                            proxy_pass  http://flame-node-node-message-broker;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                        }
                        
                        
                        # egress: analysis deployment to po: stream logs
                        location /po/stream_logs {
                            #rewrite     ^/po(/.*) $1 break;
                            proxy_pass  http://flame-node-po-service:8000;
                            allow       <Analysis Service endpoint IP>;
                            deny        all;
                            proxy_connect_timeout 10s;
                            proxy_send_timeout    120s;
                            proxy_read_timeout    120s;
                            send_timeout          120s;
                        }
                        
                        
                        # ingress: message-broker/pod-orchestration to analysis deployment
                        location /analysis {
                            rewrite     ^/analysis(/.*) $1 break;
                            proxy_pass  http://<Analysis Service Name>;
                            allow       <FLAME Message Broker Service endpoint IP>;
                            allow       <FLAME Pod Orchestrator Service endpoint IP>;
                            deny        all;
                        }
                    }
                }

Network Policy

Kubernetes Network Policy

A Kubernetes Network Policy is a resource which restricts traffic either exiting (egress) or entering (ingress) a Pod and is applied to any Pod via label selectors.

In the case of the FLAME Node, the label to apply Network Policies onto is app: <analysis deployment name> meaning that any resource in the Kubernetes cluster with a matching label selector will have this network policy applied to it e.g. the Analysis Pod. The ingress policy makes it so that only traffic coming from the associated NGINX Pod is allowed, while the egress policy only allows requests to be sent to either the NGINX Pod or the Kubernetes cluster's DNS Pod (named 'kube-dns'). The DNS permission is necessary to enable Pod name resolution within the Kubernetes cluster, thus allowing the Analysis Pod to communicate with the NGINX. No other traffic or communication, including to others Pods or the internet, is capable by the Analysis Pod while this policy is in place.

Here is the template describing this policy:

yaml

kind: NetworkPolicy
apiVersion: networking.k8s.io/v1
metadata:
  name: nginx-to-<analysis deployment name>-policy
  namespace: default
  labels:
    component: flame-nginx-to-analysis-policy
spec:
  podSelector:
    matchLabels:
      app: <analysis deployment name>
  ingress:
    - from:
        - podSelector:
            matchLabels:
              app: nginx-<analysis deployment name>
  egress:
    - to:
        - podSelector:
            matchLabels:
              app: nginx-<analysis deployment name>
        - podSelector:
            matchLabels:
              k8s-app: kube-dns
          namespaceSelector:
            matchLabels:
              kubernetes.io/metadata.name: kube-system
  policyTypes:
    - Ingress
    - Egress

Analysis Definition in Kubernetes ​

Analysis Deployment ​

Template ​

NGINX Deployment ​

Template ​

Services ​

Analysis Service Template ​

NGINX Service Template ​

ConfigMap ​

Network Policy ​

Analysis Definition in Kubernetes

Analysis Deployment

Template

NGINX Deployment

Template

Services

Analysis Service Template

NGINX Service Template

ConfigMap

Network Policy