Case Study - Cross-Repository Retrieval Infrastructure with Enforced Access Boundaries

Architecting a cross-repository retrieval infrastructure that unifies Box, Google Drive, and SharePoint data sources with enforced access boundaries to analyze years of historical grant data for a philanthropic foundation.

Client
Palm Health Foundation
Year
Service
Retrieval Infrastructure, Data Access Controls, Cross-Platform Integration

System Architecture Snapshot

  • Data Layer — Box, Google Drive, SharePoint connectors; document chunking and embedding
  • Retrieval Layer — RAG pipeline with Gemini (Vertex AI) for long-context analysis
  • Control Layer — Authentication, programmatic user provisioning, source attribution enforcement
  • Interface Layer — Conversational query interface with chat history continuity

The Challenge

Palm Health Foundation — a philanthropic organization funding mini grants for individuals, small nonprofits, and small businesses — had years of historical mini grant data scattered across multiple storage platforms: Box, Google Drive, and OneDrive/SharePoint.

This included Excel tracking spreadsheets, instructional documents, templates, interview summaries with project directors, and physical sign-in sheets.

The foundation wanted to identify themes and patterns across years of grant activities to inform future funding decisions. But with data siloed across three platforms and stored in varied formats, there was no efficient way to analyze it holistically.

The Solution

BeeNex engineered an internal retrieval application that connects to all three data repositories and enables foundation staff to ask natural language questions about their historical grant data.

The system:

  • Connects to Box, Google Drive, and OneDrive/SharePoint to ingest and index grant documents
  • Uses Gemini (Vertex AI) for long-context understanding, chosen for its speed and cost-effectiveness with large document sets
  • Provides a conversational interface where staff ask questions like "What themes emerged across 2022 mini grants?" or "Which grantees focused on youth mental health?"
  • Maintains chat history so users can pick up and continue conversations
  • Includes authentication with programmatic user creation for foundation staff

Key Deliverables

  • Private AI chat web application deployed to GCP
  • Multi-source data connectors (Box, Google Drive, SharePoint)
  • RAG pipeline for document retrieval and analysis
  • Authentication system with email-based registration
  • Chat history and conversation continuity
  • Onboarding guide, staff training, and walkthrough
  • Flutter
  • Firebase Hosting
  • Python
  • Google Cloud Platform
  • Gemini (Vertex AI)
  • RAGManaged
  • Box API
  • Google Drive API
  • SharePoint API
  • Cloud Firestore

The foundation didn't need AI to write grants — they needed AI to read years of them and surface what mattered. That's the system we deployed.

BeeNex Engineering Team,

More case studies

Full-Stack Edge AI Product Engineering — Hardware to Market

Engineering the complete product stack for a revolutionary edge AI device — from hardware-software integration and firmware optimization to product site, investor materials, and go-to-market infrastructure.

Read more

AI Boundary Control Layer for Production SaaS Platform

Engineering a constraint-layer architecture for a production SaaS platform — integrating retrieval, permissioned agent workflows, and cross-environment deployment constraints within a live production ecosystem.

Read more

Deploy AI as infrastructure — not experiment.

30-minute architectural review. Direct. Structured. No pitch deck.

Our Office

  • Melbourne, FL
    2412 Irwin St
    Melbourne, FL 32901