redhat-community-ai-tools
diff --git a/‎.coderabbit.yaml‎
Lines changed: 45 additions & 0 deletions b/‎.coderabbit.yaml‎
Lines changed: 45 additions & 0 deletions
diff --git a/‎.github/CODEOWNERS‎
Lines changed: 3 additions & 2 deletions b/‎.github/CODEOWNERS‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎.github/CODERABBIT.md‎
Lines changed: 68 additions & 0 deletions b/‎.github/CODERABBIT.md‎
Lines changed: 68 additions & 0 deletions
diff --git a/‎.github/README.md‎
Lines changed: 111 additions & 0 deletions b/‎.github/README.md‎
Lines changed: 111 additions & 0 deletions
diff --git a/‎.github/requirements.txt‎
Lines changed: 4 additions & 0 deletions b/‎.github/requirements.txt‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎.github/scripts/backup_mongo.py‎
Lines changed: 136 additions & 0 deletions b/‎.github/scripts/backup_mongo.py‎
Lines changed: 136 additions & 0 deletions
@@ -0,0 +1,45 @@
+reviews:
+  auto_review:
+    enabled: true
+    auto_incremental_review: true
+
+  commit_status: true
+  high_level_summary: true
+
+  path_filters:
+    - "multi-agents/**"
+    - "backend/**"
+    - "ui/**"
+
+  instructions: |
+    This repository represents an agentic AI system with a modular, layered architecture.
+
+    Each of the main directories (multi-agents/, backend/, ui/) contains README.md and ARCHITECTURE.md files
+    that define architectural intent, responsibilities, and boundaries.
+
+    These documents should be treated as the source of truth.
+
+    During reviews, always cross-check code changes against the architectural guidance in those files.
+
+    FOCUS AREAS:
+    - architecture
+    - maintainability
+    - modularity
+    - separation of concerns
+    - dependency boundaries
+    - layering
+    - testability
+
+    Please pay special attention to:
+
+    - Whether new components are placed in the correct layer and folder
+    - Whether responsibilities align with the architecture defined in the directory-level README.md and ARCHITECTURE.md files
+    - Whether any architectural boundaries are violated
+    - Whether new dependencies increase coupling in undesirable ways
+    - Whether changes weaken modularity or future extensibility
+    - Whether agent, backend, and UI concerns are leaking across layers
+
+    Prefer architectural and system-level feedback over stylistic feedback.
+    When relevant, explicitly reference architectural mismatches (for example: "this component seems to belong under X based on ARCHITECTURE.md").
+
+    Act as a senior software engineer deeply familiar with this project's architecture.
@@ -1,4 +1,5 @@
 * @odaiodeh @nirsisr
 
-ci/ @sfiresht
-helm/ @sfiresht
+ci/ @sfiresht @yhabushi
+helm/ @sfiresht @yhabushi
+.github/ @sfiresht @yhabushi
@@ -0,0 +1,68 @@
+# 🤖 CodeRabbit – Automated Code Review Agent
+
+This repository integrates **CodeRabbit** as an automated AI-powered code review (CR) agent.
+
+CodeRabbit analyzes every pull request and provides structured, context-aware feedback directly on the PR, helping improve overall code quality, maintainability, and consistency across the project.
+
+---
+
+## 🎯 Purpose
+
+The goal of using CodeRabbit in this project is to serve as a **first-line automated code reviewer** that assists both contributors and maintainers by:
+
+- Summarizing the intent and scope of each pull request  
+- Highlighting potential bugs, edge cases, and logical issues  
+- Suggesting improvements related to code quality, readability, and best practices  
+- Identifying maintainability, performance, and security concerns early  
+
+CodeRabbit complements human reviewers and our internal CR agents. It is not intended to replace manual reviews, but to strengthen and accelerate them.
+
+---
+
+## ⚙️ How It Works
+
+CodeRabbit is automatically triggered via **GitHub Actions** on every:
+
+- Newly opened pull request  
+- Updated pull request (new commits)  
+- Reopened pull request  
+
+Once triggered, CodeRabbit:
+
+1. Analyzes the code changes in the context of the repository  
+2. Posts a high-level summary explaining what the PR introduces  
+3. Adds inline review comments on relevant lines in the diff  
+4. Provides actionable suggestions to improve code quality and structure  
+
+All feedback appears directly inside the pull request conversation and diff view.
+
+---
+
+## 🧠 Role in the Application
+
+Within this project’s agentic AI system, CodeRabbit operates as a **general-purpose CR agent**, focusing on:
+
+- Broad software engineering best practices  
+- Cross-file change awareness  
+- Early issue detection  
+- Developer-oriented feedback  
+
+It works alongside our internal AI CR assistant, which focuses on project-specific logic, architectural reasoning, and workflow validation. Together, they form a multi-agent automated code review pipeline.
+
+---
+
+## 🧩 Benefits
+
+- Faster review cycles  
+- Higher baseline code quality  
+- Reduced reviewer fatigue  
+- More consistent feedback across pull requests  
+- Early detection of issues before human review  
+
+---
+
+## 🔁 Continuous Improvement
+
+CodeRabbit continuously adapts through interaction in pull requests and repository-level configuration, allowing its reviews to better align over time with the project’s conventions and expectations.
+
+---
@@ -0,0 +1,111 @@
+# GitHub Actions & CI/CD
+
+This folder contains all CI/automation scripts and workflows for GitHub Actions.
+
+## Overview
+
+Workflows are configured to run automated tasks using GitHub Actions. For complex operations, we use dedicated scripts (in `.github/scripts/`) that are invoked from the workflow files.
+
+## Available Workflows
+
+- **backup-dbs.yaml** - Automated database backups for MongoDB and Qdrant (backups are being uploaded to internal gitlab and have no retention at the moment)
+- **verify-agent-deps.yaml** - Dependency verification for agents
+
+
+
+## Prerequisites
+
+1. GitHub must be able to access the target cluster **OR** you must have a self-hosted runner that can access both GitHub and the cluster (see [Creating a Runner](#creating-a-runner) below)
+2. GitHub Environments must be configured with the appropriate variables and secrets for each cluster (e.g., `PRE-PRODUCTION`, `PRODUCTION`)
+
+### Important Notes
+
+- Since every deployment is a bit different. the existing workflows won't necessarily work out of the box for deployment different from the one currently in use. Users wanting to deploy UnifAI in their own clusters should be aware of the infra and networking to either fit the workflow to their needs or create a new workflow that fits it. 
+- When using runners, the `runs-on` field refers to **labels**, not runner names. Ensure matching labels exist before running workflows.
+- Environment-specific variables (like `QDRANT_URL`, `MONGO_URI`, `API_URL`) must be configured in GitHub repository settings under **Environments**.
+
+## GitHub Environments Setup
+
+The workflows use GitHub Environments to manage cluster-specific configurations:
+
+1. Go to **Settings** → **Environments** in your repository
+2. Create environments matching your cluster names (e.g., `PRE-PRODUCTION`, `PRODUCTION`)
+3. Add environment-specific variables:
+   - `API_URL` - Kubernetes API server URL
+   - `MONGO_URI` - MongoDB connection string
+   - `QDRANT_URL` - Qdrant cluster URL
+4. Add environment-specific secrets:
+   - `ACCESS_TOKEN` - Kubernetes access token
+   - Other sensitive credentials as needed
+
+## Database Backup Details
+
+### MongoDB Backup
+
+MongoDB backups are performed using `mongodump`, which is straightforward:
+
+```bash
+mongodump --uri="mongodb://localhost:27017" --out="/tmp/backup"
+```
+
+**Parameters:**
+- `--uri` - Connection string to the MongoDB instance to backup
+- `--out` - Target directory for the backup (creates a new folder)
+- `--db` (optional) - Specific database name (default: all databases)
+
+### Qdrant Backup
+
+Qdrant backups require creating snapshots via the API or UI. The workflow uses a Python script (`.github/scripts/qdrant_backup.py`) to:
+1. Connect to the Qdrant cluster
+2. Create snapshots for all collections
+3. Download the snapshots locally
+4. Upload them to the backup repository
+
+For more details, see the [Qdrant documentation](https://qdrant.tech/documentation/database-tutorials/create-snapshot/).
+
+## Running Workflows Manually
+
+### Prerequisites
+
+1. The workflow must have the `workflow_dispatch` trigger enabled
+2. GitHub CLI must be installed and authenticated
+3. The workflow file must exist in the `main` branch (workflows in feature branches cannot be manually triggered)
+
+### Example Command
+
+```bash
+gh workflow run backup-dbs.yaml \
+  -f target_cluster=PRE-PRODUCTION \
+  -f target_branch=GENIE-1071/backup_dbs \
+  -f target_namespace=tag-ai--pipeline
+```
+
+**Parameters:**
+- `-f target_cluster` - The cluster environment to backup (must match a configured GitHub Environment)
+- `-f target_branch` - The branch to checkout for the workflow
+- `-f target_namespace` - The Kubernetes namespace to backup
+
+## Appendix
+
+### Creating a Runner
+
+To create a new self-hosted runner:
+
+1. Go to your repository's **Settings** tab
+2. In the left sidebar, select **Actions** → **Runners**
+3. Click **New self-hosted runner**
+4. Follow the setup instructions (the authentication tokens are unique to your repository)
+
+For more details, see the [GitHub documentation on self-hosted runners](https://docs.github.com/en/actions/hosting-your-own-runners/managing-self-hosted-runners/adding-self-hosted-runners).
+
+
+### Connecting to gitlab
+
+Since the GitHub runners can't reach gitlab we ha to use a VM running on CNV.
+To make gitlab "accessible" to this runner, we need to set a deploy token at the target repo (go to repository > deploy keys and set the VM public key as the deploy key). This allows the VM perform actions on the target repo without needing to specify credentials.
+
+### UnifAI team infra structure
+
+In the case if the Unifai team the lab structure is a bit "special" the code resides in a public GitHub repo whereas all the deployment resources reside inside the company intra-net. to overcome this we have a self-hosted runner with access to both domains so the code is downloaded from github (for example in order to run a workflow) and then all actions are being run against the intra resources.
+
+
@@ -0,0 +1,4 @@
+qdrant-client==1.16.1
+requests>=2.32.5
+kubernetes==35.0.0
+GitPython==3.1.46
@@ -0,0 +1,136 @@
+import os
+import subprocess
+from datetime import datetime
+from kubernetes import client, config
+from kubernetes.stream import stream
+
+
+
+# Environment variables
+MONGO_POD = os.getenv("MONGO_POD")
+NAMESPACE = os.getenv("NAMESPACE")
+CLUSTER = os.getenv("CLUSTER")
+ACCESS_TOKEN = os.getenv("ACCESS_TOKEN")
+API_URL = os.getenv("API_URL")
+MONGO_URI = os.getenv("MONGO_URI")
+VERIFY_TLS = bool(os.getenv("SKIP_VERIFY_TLS"))
+
+def setup_k8s_connection():
+    """
+    Set up Kubernetes connection using environment variables
+    """
+    kube_config = {
+        "apiVersion": "v1",
+        "kind": "Config",
+        "clusters": [{
+            "name": CLUSTER,
+            "cluster": {
+                "server": API_URL,
+                "insecure-skip-tls-verify": VERIFY_TLS
+            }
+        }],
+        "users": [{
+            "name": CLUSTER,
+            "user": {"token": ACCESS_TOKEN}
+        }],
+        "contexts": [{
+            "name": CLUSTER,
+            "context": {
+                "cluster": CLUSTER,
+                "user": CLUSTER,
+                "namespace": NAMESPACE
+            }
+        }],
+        "current-context": CLUSTER
+    }
+    
+    config.load_kube_config_from_dict(kube_config)
+    print(f"✓ Connected to {CLUSTER}")
+    return client.CoreV1Api()
+
+def check_k8s_connection():
+    """Verify connection by listing resources"""
+    v1 = client.CoreV1Api()
+    apps_v1 = client.AppsV1Api()
+    
+    print("Checking pods and deployments...")
+    pods = v1.list_namespaced_pod(namespace=NAMESPACE)
+    deployments = apps_v1.list_namespaced_deployment(namespace=NAMESPACE)
+    
+    print(f"Found {len(pods.items)} pods and {len(deployments.items)} deployments")
+    return True
+
+def run_cmd_on_pod(pod_name: str, namespace: str, command: list[str]):
+    v1 = client.CoreV1Api()
+    result = stream(
+        v1.connect_get_namespaced_pod_exec,
+        pod_name,
+        namespace,
+        command=command,
+        stderr=True,
+        stdin=False,
+        stdout=True,
+        tty=False
+    )
+    return result
+
+def copy_backup_from_pod(local_path: str = None):
+    """
+    Copy the compressed backup file from the pod to local filesystem
+    
+    Args:
+        local_path: Local path to save the backup. If None, generates a timestamped filename
+
+    """
+    if local_path is None:
+        timestamp = datetime.now().strftime('%Y%m%d_%H%M%S')
+        local_path = f"/tmp/mongo_backup_{timestamp}.tar.gz"
+    
+    print(f"Downloading backup from pod to {local_path}")
+    
+    # Using kubectl cp command
+    pod_spec = f"{NAMESPACE}/{MONGO_POD}:/tmp/backup.tar.gz"
+    cmd = ['kubectl', 'cp', pod_spec, local_path, '-n', NAMESPACE, '--retries', '10']
+    
+    result = subprocess.run(cmd, capture_output=True, text=True)
+    
+    if result.returncode != 0:
+        raise Exception(f"Failed to copy backup from pod: {result.stderr}")
+    
+    print(f"✓ Backup downloaded to {local_path}")
+
+
+def remove_old_backup():
+    print("Removing old backup if they exist")
+    run_cmd_on_pod(MONGO_POD, NAMESPACE, ["rm", "-rf", "/tmp/backup"])
+    run_cmd_on_pod(MONGO_POD, NAMESPACE, ["rm", "-rf", "/tmp/backup.tar.gz"])
+    print("Old backup removed")
+
+def test_mongodb_connection():
+    print("Testing MongoDB connection")
+    run_cmd_on_pod(MONGO_POD, NAMESPACE, ["mongosh", "--eval", "db.version()", "--uri", MONGO_URI])
+    print("MongoDB connection test completed")
+
+def backup_mongodb():
+    print("Running MongoDB backup")
+    run_cmd_on_pod(MONGO_POD, NAMESPACE, ["mongodump", "--uri", MONGO_URI, "--out", "/tmp/backup"])
+    print("MongoDB backup completed")
+
+def compress_backup():
+    print("Compressing MongoDB backup")
+    run_cmd_on_pod(MONGO_POD, NAMESPACE, ["tar", "-czf", "/tmp/backup.tar.gz", "/tmp/backup"])
+    print("MongoDB backup compressed")
+
+if __name__ == "__main__":
+    # Setup connection
+    setup_k8s_connection()
+    check_k8s_connection()
+    
+    # Run backup
+    print("Starting MongoDB backup...")
+    remove_old_backup()
+    test_mongodb_connection()
+    backup_mongodb()
+    compress_backup()
+    copy_backup_from_pod()
+    print("✓ Backup complete!")