Strategic Oil & Gas Intelligence: Integrating Jewish Holidays and Lunar Cycles for Market Prediction

Publié le 13 juin 2025 par loic

Date: June 13, 2025
Category: Commodity Analysis, Geopolitical Risk
Tags: Oil Markets, Gas Trading, Geopolitical Intelligence, Risk Management

The energy markets have witnessed unprecedented volatility in recent years, with traditional analysis methods often failing to provide adequate warning of major price movements. The October 7, 2023 Hamas attack, which occurred precisely on the Jewish holiday of Simchat Torah, demonstrated how religious observances can serve as strategic timing mechanisms for geopolitical events affecting energy markets. This comprehensive analysis presents an enhanced intelligence framework that integrates Jewish holidays and lunar cycles to provide commodity specialists with 4-168 hour early warning capabilities for oil and gas market disruptions.

The Foundation: Unconventional Intelligence Indicators

The Pentagon Pizza Index Precedent

The proven effectiveness of unconventional intelligence gathering is best illustrated by the Pentagon Pizza Index, which successfully predicted recent Israeli strikes on Iran by monitoring pizza delivery spikes near defense facilities hours before military operations commenced. This method operates on a simple principle: during major geopolitical crises requiring extended work hours at defense facilities, food orders surge dramatically near key government buildings.

The concept has historical precedent, having been noted before the Grenada invasion in the 1980s, the Panama crisis in 1989, and Kuwait’s invasion when CIA pizza orders spiked the night before. The method’s effectiveness lies in its ability to detect unusual patterns in routine activities that correlate with heightened military or diplomatic activity.

Expanding Beyond Traditional Monitoring

Modern energy market intelligence requires sophisticated approaches that combine traditional economic analysis with innovative early-warning systems. Social media monitoring using artificial intelligence and natural language processing can identify emerging geopolitical topics hours before they appear on traditional news sources. Twitter-based algorithms have successfully identified geopolitical events at least a day before they became relevant on Google Trends, including missile launches and regional conflicts.

Tier 1: Jewish Holiday Intelligence Network

Simchat Torah: The Highest-Priority Indicator

Simchat Torah has emerged as the most critical indicator for geopolitical events affecting oil markets, with a reliability score of 95% and lead times of 4-8 hours. The October 7, 2023 attack was deliberately timed to coincide with this joyous Jewish festival, transforming it into what many now call the « Simchat Torah War » rather than simply referring to the calendar date.

Hamas leadership spent over two years planning this operation, specifically debating whether to conduct it on Yom Kippur or Simchat Torah, ultimately choosing the latter to maximize psychological impact on Jewish communities worldwide. The intelligence value of monitoring Simchat Torah stems from its symbolic significance as the « Joy of Torah, » making it an attractive target for adversaries seeking to inflict maximum emotional damage.

Oil prices surged 8% immediately following the October 7 attack, with crude futures initially rising 13% before stabilizing when no physical supply disruptions materialized. This pattern demonstrates how geopolitical events timed to Jewish holidays can trigger immediate risk premiums in energy markets even without actual supply chain impacts.

Yom Kippur Strategic Monitoring

Yom Kippur maintains critical importance as both the holiest day in Judaism and a historically preferred date for surprise military operations. The 1973 Yom Kippur War began on October 6, which coincided with both the Jewish Day of Atonement and the 10th day of Ramadan, demonstrating how religious calendar overlaps can amplify geopolitical risks.

The lunar phase during the 1973 attack was a new moon, providing optimal conditions for nighttime military operations while maximizing the element of surprise during religious observance. Modern intelligence frameworks must monitor increased activity patterns around Israeli defense facilities, emergency government meetings, and unusual corporate executive travel patterns 48-72 hours before Yom Kippur observance.

High Holy Days Comprehensive Framework

The Jewish High Holy Days period, encompassing Rosh Hashana through Yom Kippur, represents a 10-day window of elevated geopolitical risk requiring enhanced oil and gas market monitoring. Israeli society remains particularly vulnerable during this period, with government operations reduced and military personnel often on leave for religious observance.

Intelligence gathering during this period should focus on monitoring diaspora community travel patterns, synagogue security alerts, and changes in Israeli government operational tempo. Oil traders should implement enhanced position monitoring and volatility adjustments 48 hours before each High Holy Day, with particular attention to Middle Eastern crude benchmarks and shipping insurance premiums for vessels transiting critical chokepoints.

Tier 2: Lunar Cycle Market Intelligence

Full Moon Volatility Correlation

Statistical analysis reveals significant correlations between full moon phases and increased financial market volatility, with stock market trading volumes rising approximately 50% during full moon periods. The 2008 financial crisis and March 2020 COVID market crash both occurred near full moon phases, suggesting heightened emotional trading and increased market participation during these lunar periods.

Energy commodities demonstrate similar patterns, with crude oil futures showing increased intraday volatility ranges during full moon weeks compared to new moon periods. The October 7, 2023 attack occurred during a waning crescent moon phase (41.6% illumination), providing sufficient darkness for infiltration operations while maintaining enough visibility for coordination.

New Moon Return Patterns

Empirical research demonstrates that stock market returns during 15-day periods around new moon dates are approximately double those observed during full moon periods, with annualized differences reaching 7-10% for international markets. This « lunar cycle effect » appears strongest in emerging market economies and countries with higher baseline market volatility, suggesting cultural and psychological factors influence trading behavior beyond purely rational economic calculations.

The psychological mechanisms underlying lunar effects may stem from altered sleep patterns, increased emotional volatility, and changes in risk-taking behavior among market participants. Energy traders should consider position sizing adjustments based on lunar phase timing, particularly for short-term options strategies and volatility trading approaches.

Critical Geopolitical Chokepoint Monitoring

Strait of Hormuz Intelligence Network

With 20% of global oil flow transiting this chokepoint, implementing combined tanker AIS tracking with insurance premium monitoring provides crucial early warning capabilities. Recent tensions following Israeli strikes on Iran highlight the criticality of this route, where military exercise announcements and tanker insurance rate spikes provide 24-hour early warning of potential disruptions.

Red Sea Shipping Intelligence

Following Russian support for Houthi operations and continued attacks on commercial vessels, monitoring Houthi social media channels combined with real-time shipping delay data provides 12-hour lead times for Red Sea disruptions affecting 12% of global oil flows through the Suez Canal.

Russia-Ukraine Pipeline Monitoring

Monitoring pipeline pressure data combined with social media sentiment analysis along the Russia-Ukraine border provides early warning for routes handling 40% of European natural gas flows. Recent sabotage incidents demonstrate the vulnerability of these critical infrastructure assets to geopolitical tensions.

Advanced Maritime Intelligence Systems

Dark Shipping Detection Protocols

Implementing RF geolocation monitoring to identify vessels that disable AIS transponders, particularly around sanctioned countries like Iran and Venezuela, provides 24-hour lead times for detecting sanction evasion activities that can trigger enforcement actions affecting global oil flows.

Ship-to-Ship Transfer Surveillance

Monitoring unusual ship-to-ship transfers using satellite imagery and AIS data correlation provides early warning of sanctions circumvention activities. Recent intelligence reveals increased STS operations in the Aegean Sea for disguising Russian oil cargo origins.

Implementation Strategy and Risk Assessment

Phase 1: Immediate Implementation (0-30 days)

Deploy satellite-based gas flare detection and tanker AIS tracking systems, which provide 2-72 hour lead times with minimal implementation complexity. These systems leverage existing commercial satellite networks and maritime transponder data for immediate operational capability.

Simchat Torah alert networks should integrate Israeli community observance tracking with diaspora synagogue security reports to identify unusual activity patterns preceding potential incidents. Emergency meeting detection systems must monitor corporate calendar changes among energy sector executives, which historically provide 12-24 hour warning of significant market-moving events.

Phase 2: Advanced Capabilities (30-90 days)

Integrate social media sentiment analysis with corporate calendar monitoring to detect insider knowledge indicators. Focus on energy sector professional networks and executive communication patterns for early warning of strategic decisions.

Lunar phase correlation analysis requires integration of astronomical data with historical volatility patterns to establish predictive models for energy market behavior. Religious calendar overlap tracking becomes critical during periods when Jewish holidays coincide with Islamic observances, creating compounded risks for Middle Eastern stability and oil supply security.

Phase 3: Comprehensive Network (90+ days)

Establish automated correlation systems linking multiple indicator streams for predictive modeling. Combine weather-driven volatility forecasting with geopolitical risk assessment for comprehensive market intelligence.

Full moon volatility models require extensive backtesting against historical energy market data to establish reliable correlation thresholds and trading signal generation. The comprehensive system targets 85-90% accuracy for combined indicator alerts, with potential cost savings of $50-100 million per major event avoided through improved early warning capabilities.

Strategic Recommendations for Energy Market Participants

Immediate Risk Management Protocols

Energy traders should implement automated volatility adjustments and enhanced position monitoring beginning 48 hours before major Jewish holidays, with particular focus on Simchat Torah, Yom Kippur, and High Holy Days periods. Lunar phase tracking should inform medium-term position sizing decisions, with reduced leverage during full moon periods when market volatility typically increases.

Supply chain managers must build strategic reserves before high-risk religious observance periods and establish alternative routing protocols for shipments transiting Middle Eastern chokepoints during sensitive dates. Corporate risk management frameworks should integrate religious calendar data with traditional geopolitical intelligence gathering, ensuring decision-makers receive adequate warning of potential market disruptions.

Long-Term Strategic Implementation

The integration of religious and lunar intelligence requires dedicated analytical resources with deep cultural expertise in Jewish, Islamic, and lunar calendar systems. Investment in satellite monitoring capabilities for oil storage facilities and shipping chokepoints provides objective validation of activity level changes that supplement human intelligence gathering.

Regional energy security planning must account for the symbolic significance of religious observances in strategic decision-making by state and non-state actors. The enhanced intelligence framework provides competitive advantages through improved early warning capabilities, but requires careful operational security to prevent adversaries from adapting their timing strategies in response to known monitoring capabilities.

Conclusion

The integration of Jewish holidays and lunar cycles into oil and gas market intelligence represents a paradigm shift from traditional economic analysis toward comprehensive geopolitical risk assessment. The October 7, 2023 Hamas attack and subsequent market reactions validate the importance of religious timing considerations in modern energy market dynamics.

Success in implementing these unconventional intelligence methods requires combining high-automation satellite systems with human-analyzed social intelligence for comprehensive coverage. Energy market participants who adopt these enhanced intelligence frameworks will gain significant competitive advantages through improved early warning capabilities and more effective risk management during periods of heightened geopolitical tension.

The current volatile geopolitical environment, characterized by ongoing Middle Eastern conflicts and global supply chain vulnerabilities, makes these advanced intelligence capabilities essential for energy sector success. Organizations that fail to integrate these unconventional indicators into their risk management frameworks risk being consistently surprised by market events that more sophisticated intelligence systems can anticipate with 85-95% accuracy.

Disclaimer: This analysis is for informational purposes only and should not be considered as investment advice. Energy markets are subject to significant volatility and risk, and past performance does not guarantee future results.

Sources : Loic

Maximizing Your Claude Max Subscription: Complete Guide to Automated Workflows with Claude Code and Windsurf

Publié le 27 mai 2025 par loic

The Claude Max plan at $100 per month has revolutionized how developers can integrate Claude’s powerful AI capabilities directly into their development workflow. With the recent integration of Claude Code into the Max subscription, users can now access terminal-based AI assistance without burning through expensive API tokens. This comprehensive guide shows you how to set up a complete development environment using Windsurf, Claude Code, and your Claude Max subscription, including advanced automation workflows that maximize productivity.

Understanding the Claude Max Plan Value Proposition

The $100 monthly Claude Max plan provides 5x more usage than Claude Pro, translating to approximately 225 messages every 5 hours. This expanded capacity makes it ideal for developers who need sustained AI assistance throughout their coding sessions without constantly hitting usage limits.

What makes this plan particularly attractive is the inclusion of Claude Code at no additional cost. Previously, using Claude Code required separate API tokens, but as of May 2025, Max plan subscribers can use Claude Code directly through their subscription.

Setting Up Claude Code with Your Max Subscription

Installation and Authentication

Getting started with Claude Code on your Max plan is straightforward. First, install Claude Code following the official documentation, then authenticate using only your Max plan credentials.

The key is ensuring you’re using your Max subscription rather than API credits:claude logout claude login

During the login process, authenticate with the same credentials you use for claude.ai and decline any API credit options when prompted. This ensures Claude Code draws exclusively from your Max plan allocation.

Avoiding API Credit Prompts

One crucial aspect of staying within your $100 monthly budget is preventing Claude Code from defaulting to API credits when you approach your usage limits. Configure your setup to avoid these prompts entirely by:

Using only Max plan credentials during authentication
Declining API credit options when they appear
Monitoring your usage with the /status command

Integrating Claude Code with Windsurf via MCP

Windsurf’s Model Context Protocol (MCP) support allows you to create a seamless bridge between Claude Code and your IDE. This integration transforms Claude Code into an MCP server that Windsurf can call upon for complex coding tasks.

MCP Configuration

Create or modify your mcp_config.json file in Windsurf’s configuration directory:

macOS: ~/.codeium/windsurf/mcp_config.json
Windows: %APPDATA%\Codeium\windsurf\mcp_config.json
Linux: ~/.config/.codeium/windsurf/mcp_config.json

Add this configuration:{ "mcpServers": { "claude-code": { "command": "claude", "args": ["mcp", "serve"], "env": {} } } }

Starting the MCP Server

Launch Claude Code as an MCP server directly from your terminal:claude mcp serve

This command transforms Claude Code into a service that Windsurf can interact with programmatically, providing access to Claude’s coding capabilities through the MCP protocol.

Creating Custom Workflows for Automatic Task Delegation

With Claude Code accessible via MCP, you can create sophisticated custom workflows that automatically delegate specific types of tasks to Claude Code. This automation maximizes your productivity while staying within your Max plan limits.

Setting Up Workflow Infrastructure

Windsurf’s Wave 8 update introduced Custom Workflows, which allows you to define shared slash commands that can automate repetitive tasks. Start by creating the workflow directory structure:mkdir -p .windsurf/workflows mkdir -p .windsurf/rules

Complex Refactoring Operations Workflow

Create .windsurf/workflows/refactor.md:# Complex Refactoring Workflow ## Trigger When user requests complex refactoring operations involving multiple files or architectural changes ## Action Use claude_code tool with the following prompt template:

Your work folder is {PROJECT_PATH}

TASK TYPE: Complex Refactoring
TASK ID: refactor-{TIMESTAMP}

CONTEXT:

Target files: {TARGET_FILES}
Refactoring goal: {REFACTORING_GOAL}
Constraints: {CONSTRAINTS}

INSTRUCTIONS:

Analyze current code structure and dependencies
Create refactoring plan with step-by-step approach
Execute refactoring while maintaining functionality
Run tests to verify changes
Update documentation if needed

COMPLETION CRITERIA:

All tests pass
Code follows project conventions
No breaking changes introduced

## Parameters - TARGET_FILES: List of files to refactor - REFACTORING_GOAL: Description of desired outcome - CONSTRAINTS: Any limitations or requirements

Documentation Generation Workflow

Create .windsurf/workflows/docs.md:

# Documentation Generation Workflow ## Trigger When user requests documentation generation for code, APIs, or project structure ## Action Use claude_code tool with documentation-specific prompt:

Your work folder is {PROJECT_PATH}

TASK TYPE: Documentation Generation
TASK ID: docs-{TIMESTAMP}

CONTEXT:

Documentation type: {DOC_TYPE}
Target audience: {AUDIENCE}
Output format: {FORMAT}

INSTRUCTIONS:

Analyze codebase structure and functionality
Generate comprehensive documentation following project standards
Include code examples and usage patterns
Create or update README, API docs, or inline comments
Ensure documentation is up-to-date with current implementation

DELIVERABLES:

Generated documentation files
Updated existing documentation
Code comments where appropriate

## Parameters - DOC_TYPE: API, README, inline comments, etc. - AUDIENCE: developers, end-users, maintainers - FORMAT: Markdown, JSDoc, Sphinx, etc.

Code Review and Analysis Workflow

Create .windsurf/workflows/code-review.md:

# Code Review and Analysis Workflow ## Trigger When user requests code review, security audit, or quality analysis ## Action Use claude_code tool with analysis-specific prompt:

Your work folder is {PROJECT_PATH}

TASK TYPE: Code Review and Analysis
TASK ID: review-{TIMESTAMP}

CONTEXT:

Review scope: {REVIEW_SCOPE}
Focus areas: {FOCUS_AREAS}
Standards: {CODING_STANDARDS}

INSTRUCTIONS:

Perform comprehensive code analysis
Check for security vulnerabilities
Evaluate performance implications
Assess code maintainability
Verify adherence to coding standards
Generate detailed report with recommendations

DELIVERABLES:

Code quality assessment
Security vulnerability report
Performance optimization suggestions
Refactoring recommendations

## Parameters - REVIEW_SCOPE: specific files, modules, or entire codebase - FOCUS_AREAS: security, performance, maintainability, etc. - CODING_STANDARDS: project-specific or industry standards

Architecture Planning Workflow

Create .windsurf/workflows/architecture.md:

# Architecture Planning Workflow ## Trigger When user requests system design, architecture review, or structural planning ## Action Use claude_code tool with architecture-specific prompt:

Your work folder is {PROJECT_PATH}

TASK TYPE: Architecture Planning
TASK ID: arch-{TIMESTAMP}

CONTEXT:

Project scope: {PROJECT_SCOPE}
Requirements: {REQUIREMENTS}
Constraints: {CONSTRAINTS}
Technology stack: {TECH_STACK}

INSTRUCTIONS:

Analyze current architecture (if existing)
Identify architectural patterns and best practices
Design scalable and maintainable structure
Create component diagrams and documentation
Provide implementation roadmap
Consider performance and security implications

DELIVERABLES:

Architecture documentation
Component diagrams
Implementation plan
Technology recommendations

## Parameters - PROJECT_SCOPE: feature, module, or entire system - REQUIREMENTS: functional and non-functional requirements - CONSTRAINTS: budget, timeline, technology limitations - TECH_STACK: current or preferred technologies

Implementing Automatic Task Delegation

File-Based Rules Configuration

Create intelligent delegation rules based on file types and project context. Create .windsurf/rules/delegation.md:

# Automatic Delegation Rules ## File Type Rules - **/*.py, **/*.js, **/*.ts: Complex operations → Claude Code - **/*.md, **/*.rst: Documentation tasks → Claude Code - **/*.json, **/*.yaml: Configuration analysis → Claude Code ## Task Complexity Rules - Multi-file operations → Always delegate to Claude Code - Single file edits < 50 lines → Use native Windsurf - Architectural changes → Always delegate to Claude Code - Performance optimization → Always delegate to Claude Code ## Project Size Rules - Large projects (>1000 files) → Delegate complex operations - Medium projects (100-1000 files) → Delegate multi-file operations - Small projects (<100 files) → Selective delegation

Smart Delegation Configuration

Create .windsurf/workflows/smart-delegation.json:

{ "delegationRules": { "triggers": [ { "keywords": ["refactor", "restructure", "reorganize", "optimize"], "action": "delegate_to_claude_code", "workflow": "refactor", "priority": "high" }, { "keywords": ["document", "docs", "documentation", "readme"], "action": "delegate_to_claude_code", "workflow": "docs", "priority": "medium" }, { "keywords": ["review", "analyze", "audit", "check"], "action": "delegate_to_claude_code", "workflow": "code-review", "priority": "high" }, { "keywords": ["architecture", "design", "structure", "plan"], "action": "delegate_to_claude_code", "workflow": "architecture", "priority": "high" } ], "fileTypeRules": { "*.py": "Use claude_code for Python-specific operations", "*.js": "Use claude_code for complex JavaScript refactoring", "*.ts": "Use claude_code for TypeScript architectural changes", "*.md": "Use claude_code for documentation generation" }, "complexityThresholds": { "high": "Automatically use claude_code with detailed prompts", "medium": "Offer claude_code as option with user confirmation", "low": "Use native Windsurf capabilities" } } }

Advanced Workflow Patterns

Boomerang Pattern Implementation

Implement the Boomerang pattern where Windsurf orchestrates complex tasks and delegates subtasks to Claude Code:# .windsurf/workflows/boomerang-orchestration.md

## Parent Task Orchestration Pattern ### Flow Structure 1. **Task Analysis**: Windsurf analyzes complex user request 2. **Subtask Breakdown**: Generate specific Claude Code prompts 3. **Parallel Delegation**: Send subtasks to Claude Code via MCP 4. **Result Integration**: Combine Claude Code outputs intelligently 5. **Quality Assurance**: Validate integrated solution 6. **Final Delivery**: Present unified solution to user ### Example Implementation User Request: "Optimize our API performance and add comprehensive monitoring" **Windsurf Orchestration:** - Performance analysis → Claude Code (workflow: code-review) - Database optimization → Claude Code (workflow: refactor) - Caching implementation → Claude Code (workflow: architecture) - Monitoring setup → Claude Code (workflow: architecture) - Documentation update → Claude Code (workflow: docs) **Integration Phase:** - Combine optimization recommendations - Ensure compatibility between changes - Create unified implementation plan - Generate comprehensive documentation

Multiple Cascades for Parallel Processing

Leverage Windsurf’s simultaneous cascades for parallel workflow execution:# Example parallel workflow execution /architecture-review --async --project-scope=backend /refactor-components --async --target=frontend /update-docs --async --doc-type=api /security-audit --async --scope=authentication

Context-Aware Delegation System

Create an intelligent system that automatically determines when to delegate tasks:// .windsurf/workflows/intelligent-delegation.js

Maximizing Your Development Workflow

Strategic Usage Patterns

With approximately 225 messages every 5 hours on the $100 Max plan, strategic usage becomes important. Consider these approaches:

High-Value Delegation: Reserve Claude Code for tasks where it provides the most value:

Complex multi-file refactoring operations
Comprehensive code analysis and security audits
Architecture planning and system design
Documentation generation for large codebases

Efficient Batching: Group related tasks to maximize context utilization:

Combine refactoring with documentation updates
Pair code review with optimization recommendations
Bundle architecture planning with implementation guidance

Queue-Based Workflow Management

Implement a queue system for managing multiple workflows:# Queue multiple tasks for efficient processing windsurf queue add refactor --files="src/components/*.js" --goal="performance" windsurf queue add docs --type="api" --format="openapi" windsurf queue add review --scope="security" --focus="authentication" # Process queue efficiently windsurf queue process --batch-size=3 --use-claude-code

Hybrid Development Strategy

The most effective approach combines multiple tools strategically:

Windsurf’s native AI for quick queries, simple edits, and general assistance
Claude Code via MCP for complex operations, architectural decisions, and comprehensive analysis
Direct claude.ai access for research, planning, and brainstorming sessions
Automated workflows for repetitive tasks and standardized processes

Monitoring and Optimization

Usage Tracking and Management

Keep track of your consumption and optimize usage patterns:# Monitor Claude Code usage claude status # Track workflow effectiveness windsurf workflows stats --period=week # Analyze delegation patterns windsurf analyze delegation-effectiveness --export=csv

Workflow Performance Analytics

Create a monitoring system for your automated workflows:# .windsurf/monitoring/workflow-metrics.md

## Key Performance Indicators - **Task Success Rate**: Percentage of workflows completing successfully - **Time to Completion**: Average time for each workflow type - **Usage Efficiency**: Claude messages per completed task - **User Satisfaction**: Quality rating of workflow outputs ## Optimization Triggers - Success rate < 85% → Review and refine workflow prompts - Completion time > expected → Optimize task breakdown - Usage efficiency declining → Improve prompt specificity - User satisfaction < 4/5 → Gather feedback and iterate

Continuous Improvement Process

Implement a systematic approach to workflow optimization:

Weekly Review: Analyze workflow performance metrics
Monthly Optimization: Update prompts and delegation rules based on data
Quarterly Assessment: Evaluate overall strategy effectiveness
User Feedback Integration: Regularly collect and incorporate user feedback

Cost-Effectiveness Analysis

At $100 per month, the Claude Max plan with automated workflows offers exceptional value:

Direct Cost Savings:

Eliminates API token costs for Claude Code usage
Predictable monthly expenses for budgeting
No surprise billing from heavy usage periods

Productivity Multipliers:

Automated task delegation reduces manual workflow management
Parallel processing capabilities increase throughput
Intelligent delegation ensures optimal tool usage for each task

Quality Improvements:

Consistent workflow execution reduces human error
Standardized prompts ensure reliable output quality
Comprehensive automation covers more aspects of development

Advanced Integration Possibilities

Team Collaboration Workflows

Extend your automation to support team development:# .windsurf/workflows/team-collaboration.md

## Shared Workflow Standards - Consistent code review processes across team members - Standardized documentation generation - Unified architecture decision processes - Collaborative refactoring workflows ## Team-Specific Configurations - Role-based workflow access (senior dev, junior dev, architect) - Project-specific delegation rules - Shared workflow templates and best practices - Cross-team workflow sharing and reuse

CI/CD Integration

Integrate your workflows with continuous integration:# .github/workflows/claude-code-automation.yml

name: Automated Code Quality with Claude Code on: pull_request: branches: [ main, develop ] jobs: claude-code-review: runs-on: ubuntu-latest steps: - uses: actions/checkout@v3 - name: Setup Claude Code run: | # Setup Claude Code with Max subscription claude login --token=${{ secrets.CLAUDE_MAX_TOKEN }} - name: Automated Code Review run: | windsurf workflow execute code-review \ --scope="changed-files" \ --format="github-comment" \ --auto-comment=true

Troubleshooting Common Issues

Delegation Failures

When workflows fail to delegate properly:

Check MCP Connection: Verify Claude Code MCP server is running
Validate Credentials: Ensure Max subscription authentication is active
Review Workflow Syntax: Check workflow definition files for errors
Monitor Usage Limits: Verify you haven’t exceeded your 5-hour allocation

Performance Optimization

If workflows are running slowly:

Optimize Prompts: Make prompts more specific and focused
Reduce Context Size: Break large tasks into smaller, focused subtasks
Parallel Processing: Use multiple cascades for independent tasks
Cache Results: Store frequently used outputs to avoid regeneration

Conclusion

The Claude Max plan at $100 per month, combined with automated workflows in Windsurf and Claude Code integration, creates a powerful development environment that maximizes AI assistance while maintaining cost control. By implementing the comprehensive workflow automation described in this guide, developers can:

Achieve 3-5x productivity gains through intelligent task delegation
Maintain predictable costs without API token concerns
Ensure consistent quality through standardized automated processes
Scale development practices across teams and projects

This setup represents the future of AI-assisted development: seamless integration, intelligent automation, and powerful capabilities that enhance rather than replace developer expertise. The key to success lies in proper configuration, strategic usage patterns, and continuous optimization of your automated workflows.

With these elements in place, your $100 monthly investment in Claude Max becomes a force multiplier for your development productivity, providing enterprise-level AI assistance with the reliability and predictability that professional development teams require.

The automated workflow system described here transforms Claude Code from a simple terminal tool into an intelligent development partner that understands your project context, anticipates your needs, and delivers consistent, high-quality results across all aspects of your development process.

My Vibe coding Rules: Critical Thinking Enhancement System

Publié le 25 mai 2025 par loic

Core Configuration

Version: v6
Project Type: web_application
Code Style: clean_and_maintainable
Environment Support: dev, test, prod
Thinking Mode: critical_analysis_enabled

Critical Thinking Rules for Development

1. The Assumption Detector

ALWAYS ask before implementing: « What hidden assumptions am I making about this code/architecture? What evidence might contradict my current approach? »

2. The Devil’s Advocate

Before major implementations: « If you were trying to convince me this is a terrible approach, what would be your strongest arguments? »

3. The Ripple Effect Analyzer

For architectural changes: « Beyond the obvious first-order effects, what second or third-order consequences should I consider in this codebase? »

4. The Blind Spot Illuminator

When debugging persists: « I keep experiencing [problem] despite trying [solution attempts]. What factors might I be missing? »

5. The Status Quo Challenger

For legacy code decisions: « We’ve always used [current approach], but it’s not working. Why might this method be failing, and what radical alternatives could work better? »

6. The Clarity Refiner

When requirements are unclear: « I’m trying to make sense of [topic or technical dilemma]. Can you help me clarify what I’m actually trying to figure out? »

7. The Goal Realignment Check

During development sprints: « I’m currently working toward [goal]. Does this align with what I truly value, or am I chasing the wrong thing? »

8. The Fear Dissector

When hesitating on technical decisions: « I’m hesitating because I’m afraid of [fear]. Is this fear rational? What’s the worst that could realistically happen? »

9. The Feedback Forager

For fresh perspective: « Here’s what I’ve been thinking: . What would someone with a very different technical background say about this? »

10. The Tradeoff Tracker

For architectural decisions: « I’m choosing between [option A] and [option B]. What are the hidden costs and benefits of each that I might not be seeing? »

11. The Progress Checker

For development velocity: « Over the past [time period], I’ve been working on [habit/goal]. Based on my current actions, am I on track or just spinning my wheels? »

12. The Values Mirror

When feeling disconnected from work: « Lately, I’ve felt out of sync. What personal values might I be neglecting or compromising right now? »

13. The Time Capsule Test

For major technical decisions: « If I looked back at this decision a year from now, what do I hope I’ll have done—and what might I regret? »

Test-Driven Development (TDD) Rules

Write tests first before any production code.
Apply Rule 1 (Assumption Detector) before writing tests: « What assumptions am I making about this feature’s requirements? »
Use Rule 2 (Devil’s Advocate) on test design: « How could these tests fail to catch real bugs? »
Run tests before implementing new functionality.
Write the minimal code required to pass tests.
Apply Rule 13 (Time Capsule Test) before refactoring: « Will this refactor make the code more maintainable in a year? »
Do not start new tasks until all tests are passing.
Place all tests in a dedicated /tests directory.
Explain why tests will initially fail before implementation.
Propose an implementation strategy using Rule 6 (Clarity Refiner) before writing code.

Code Quality Standards with Critical Analysis

Maximum file length: 300 lines (apply Rule 4 if constantly hitting this limit).
Use Rule 5 (Status Quo Challenger) when following existing patterns that seem problematic.
Apply Rule 10 (Tradeoff Tracker) for architectural decisions between maintainability vs. performance.
Implement proper error handling using Rule 8 (Fear Dissector) to identify real vs. imagined failure scenarios.
Use Rule 3 (Ripple Effect Analyzer) before major refactoring efforts.
Add explanatory comments when necessary, but question with Rule 1: « Am I assuming this code is self-explanatory when it’s not? »

AI Assistant Critical Thinking Behavior

Apply Rule 6 (Clarity Refiner) to understand requirements before proceeding.
Use Rule 9 (Feedback Forager) by asking clarifying questions when requirements are ambiguous.
Apply Rule 2 (Devil’s Advocate) to proposed solutions before implementation.
Use Rule 4 (Blind Spot Illuminator) when debugging complex issues.
Apply Rule 11 (Progress Checker) to ensure solutions actually solve the core problem.

Critical Thinking Implementation Strategy

Pre-Development Analysis

Apply Rules 1, 6, 7 before starting any new feature
Use Rule 13 for architectural decisions that will impact the project long-term

During Development

Invoke Rule 4 when stuck on implementation details
Apply Rule 3 before making changes that affect multiple files
Use Rule 11 to assess if current approach is actually working

Code Review Process

Apply Rule 2 to all proposed changes
Use Rule 10 to evaluate different implementation approaches
Invoke Rule 9 to get perspective on code clarity and maintainability

Debugging Sessions

Start with Rule 4 to identify overlooked factors
Use Rule 5 to challenge assumptions about existing debugging approaches
Apply Rule 1 to question what you think you know about the bug

Meta-Rule for Complex Problems

When facing complex technical challenges, combine multiple critical thinking rules:

« I want to examine [technical problem/architectural decision] under all angles. Help me apply the Assumption Detector, Devil’s Advocate, and Ripple Effect Analyzer to ensure I’m making the best technical decision possible. »

Workflow Best Practices Enhanced with Critical Thinking

Planning & Task Management

Apply Rule 7 (Goal Realignment Check) when updating PLANNING.md
Use Rule 11 (Progress Checker) when reviewing TASK.md milestones
Invoke Rule 6 (Clarity Refiner) for ambiguous requirements

Architecture Decisions

Always apply Rule 10 (Tradeoff Tracker) for technology choices
Use Rule 13 (Time Capsule Test) for decisions affecting long-term maintainability
Apply Rule 3 (Ripple Effect Analyzer) before major structural changes

Code Review & Refactoring

Use Rule 2 (Devil’s Advocate) on all proposed changes
Apply Rule 5 (Status Quo Challenger) to legacy code patterns
Invoke Rule 1 (Assumption Detector) during code reviews

Verification Rule Enhanced

I am an AI coding assistant that strictly adheres to Test-Driven Development (TDD) principles, high code quality standards, and critical thinking methodologies. I will:

Apply critical thinking rules before, during, and after development tasks
Write tests first using assumption detection and devil’s advocate analysis
Question my own proposed solutions using multiple perspective analysis
Challenge existing patterns when they may be causing problems
Analyze ripple effects of architectural decisions
Maintain awareness of hidden assumptions and blind spots
Regularly assess progress and goal alignment
Consider long-term implications of technical decisions
Seek diverse perspectives on complex problems
Balance rational analysis with intuitive concerns

This enhanced system transforms every coding decision into a multi-perspective analysis to avoid costly mistakes and improve solution quality.

La Méthode des Trois Sphères : Une Approche Intégrée pour le Développement d’Applications par IA

Publié le 27 avril 2025 par loic

Une Méthodologie Structurée pour l’Ère de l’IA

Dans le paysage actuel du développement d’applications, une approche méthodique et structurée est essentielle pour transformer efficacement les idées en produits fonctionnels. La Méthode des Trois Sphères offre un cadre complet qui guide chaque étape du processus de développement, de la conceptualisation initiale à l’implémentation technique détaillée.

Sphère 1: Définition du Produit & Fondation Architecturale

Cette première phase établit les bases solides sur lesquelles reposera tout le projet:

Inputs requis:

Concept initial de l’application
Public cible et problématique à résoudre
Contraintes commerciales et techniques

Processus:

Définir clairement l’objectif principal du projet
Créer des personas utilisateur détaillés
Développer un argumentaire commercial (pitch)
Établir les exigences commerciales fondamentales
Identifier les fonctionnalités clés avec leurs objectifs
Ébaucher l’architecture technique globale
Définir les sous-objectifs mesurables

Outputs:

Document de vision du produit
Spécifications des exigences commerciales
Architecture préliminaire avec cartographie des fonctionnalités
Critères de réussite du projet

Sphère 2: Conception UX & Expansion des Fonctionnalités

Cette phase développe l’expérience utilisateur et approfondit chaque fonctionnalité:

Inputs requis:

Documents de la Sphère 1
Références de design et contraintes de marque
Comportements utilisateur attendus

Processus:

Élaborer plusieurs options de design pour l’application
Concevoir spécifiquement pour les personas identifiés
Détailler chaque fonctionnalité avec ses objectifs, relations et dépendances
Spécifier les besoins en API pour chaque fonctionnalité
Documenter les workflows d’expérience utilisateur
Créer une structure de composants et d’interaction
Détailler les exigences de données et de sécurité par fonctionnalité

Outputs:

Documentation UI/UX complète
Maquettes ou wireframes conceptuels
Spécifications détaillées des fonctionnalités
Modèles d’interaction et patterns de design
Documentation des flux utilisateur

Sphère 3: Planification Technique & Spécifications d’Implémentation

Cette phase finale transforme la vision en plan d’action concret:

Inputs requis:

Tous les documents des sphères précédentes
Contraintes techniques et stack technologique préférée
Ressources disponibles pour le développement

Processus:

Définir l’architecture logicielle détaillée
Établir les patterns architecturaux à utiliser
Spécifier les routes API et endpoints
Concevoir la structure de la base de données
Décomposer chaque fonctionnalité en tâches granulaires
Spécifier les fichiers à créer ou modifier et comment
Créer un plan d’implémentation étape par étape
Documenter les meilleures pratiques pour le développement
Établir les stratégies de test et de déploiement

Outputs:

Spécifications techniques complètes
Documentation API détaillée
Plan de développement actionnable
Liste de tâches priorisées pour l’implémentation
Documentation sur la stack technique et le flux de données

Avantages de la Méthode des Trois Sphères

Cette approche structurée offre plusieurs avantages significatifs:

Couverture complète du cycle de développement: De la conception initiale à l’implémentation technique
Équilibre entre vision commerciale, expérience utilisateur et faisabilité technique
Structure évolutive adaptée aux projets de toutes tailles
Documentation progressive où chaque phase alimente la suivante
Prévention proactive des problèmes avant le début du codage
Compatibilité avec différents modèles d’IA comme Claude, GPT, O3 Mini High ou DeepSeek
Clarté dans la communication entre toutes les parties prenantes du projet

Conseils d’Implémentation

Privilégiez la phase de conception aussi longtemps que nécessaire avant de commencer à coder
Conservez toute la documentation au format markdown pour une référence facile pendant le développement
Pour les premiers projets, laissez l’IA suggérer des recommandations plutôt que d’être trop directif
Considérez ces documents comme des ressources vivantes à affiner au cours du développement
Utilisez des outils comme Cursor AI, Windsurf ou Github Copilot pour implémenter le plan détaillé
Revoyez systématiquement chaque sphère avant de passer à la suivante

Cette méthodologie des Trois Sphères représente une approche complète qui transforme une idée initiale en un plan d’action détaillé, offrant une structure claire tout en permettant la flexibilité nécessaire pour s’adapter aux spécificités de chaque projet.

The Future of Retrieval: A Fusion of ColBERT, LightRAG, and RAPTOR

Publié le 17 avril 2025 par loic

In the evolving landscape of information retrieval and AI-powered search, three innovative approaches have emerged as game-changers: ColBERT, LightRAG, and RAPTOR. Each brings unique strengths to the table, but their true potential lies in fusion—combining these technologies to create a retrieval system greater than the sum of its parts. Let’s explore these models and how their integration can revolutionize information retrieval.

ColBERT: Contextual Precision at the Token Level

ColBERT (Contextualized Late Interaction over BERT) represents a significant advancement in neural information retrieval. Unlike traditional retrieval methods that compress entire documents into single vectors, ColBERT preserves the contextual representation of each token in both queries and documents.

What makes ColBERT special is its « late interaction » mechanism. Rather than computing a single similarity score between query and document vectors, ColBERT calculates fine-grained interactions between each query token and document token. This approach allows for more precise matching, especially for queries containing specific terms or phrases.

The beauty of ColBERT lies in its ability to balance the precision of exact matching with the contextual understanding of neural models. When a user searches for specific technical terms or rare phrases, ColBERT can identify the exact matches while still understanding their context within documents.

LightRAG: Graph-Based Knowledge Navigation

LightRAG takes a fundamentally different approach by leveraging graph structures to represent knowledge. Think of it as creating a map of information where entities (like concepts, people, or objects) are connected through meaningful relationships.

The « Light » in LightRAG refers to its streamlined architecture compared to more complex graph-based retrieval systems. It focuses on three core elements: entities, relations, and the graph itself. This simplification makes it more efficient while maintaining powerful retrieval capabilities.

What sets LightRAG apart is its dual-level retrieval paradigm. When processing a query, it first identifies relevant entities and then navigates the connections between them. This allows the system to follow logical paths through information—much like how humans make connections between related concepts.

For example, if you’re researching climate change impacts on agriculture, LightRAG might connect entities like « rising temperatures, » « crop yields, » and « food security » even if they don’t appear together in the same document. This ability to bridge information gaps makes LightRAG particularly powerful for complex queries requiring multi-hop reasoning.

RAPTOR: Hierarchical Understanding Through Recursive Abstraction

RAPTOR (Recursive Abstractive Processing for Tree-Organized Retrieval) approaches information organization from yet another angle—hierarchical abstraction. It builds a tree-like structure of information at varying levels of detail, from specific facts to broad concepts.

The process begins by breaking documents into small chunks and embedding them using semantic models. These chunks are then clustered based on similarity, and a language model generates concise summaries for each cluster. This process repeats recursively, creating higher-level summaries until a comprehensive hierarchical structure emerges.

What makes RAPTOR powerful is its ability to maintain both breadth and depth of understanding. When responding to a query, it can navigate this tree structure to find the appropriate level of detail—providing broad context when needed or drilling down to specific facts.

This hierarchical approach is particularly valuable for complex topics where understanding requires both the big picture and specific details. For instance, when researching a medical condition, RAPTOR can provide both high-level overviews of treatment approaches and specific details about particular medications.

The Power of Fusion: Creating a Hybrid Retrieval System

While each of these approaches offers significant advantages, their true potential emerges when combined into a hybrid system. Here’s how these technologies complement each other:

Complementary Strengths

ColBERT excels at precise token-level matching, making it ideal for queries requiring exact phrase matching or specific terminology.

LightRAG shines in connecting related information across documents, enabling multi-hop reasoning and bridging knowledge gaps.

RAPTOR provides hierarchical context, allowing the system to understand both broad themes and specific details within a topic.

How Fusion Works

A fused retrieval system leverages all three approaches in parallel, then combines their results through a sophisticated ranking algorithm. Here’s a conceptual workflow:

Query Processing: When a user submits a query, it’s processed simultaneously by all three systems.
Multi-faceted Retrieval:

ColBERT identifies documents with precise token-level matches
LightRAG navigates entity relationships to find connected information
RAPTOR traverses its hierarchical structure to retrieve relevant summaries and details

Result Fusion: The results from each system are combined using a weighted fusion algorithm that considers:

The confidence score from each retrieval method
The diversity of information provided
The complementary nature of the retrieved content

Contextual Ranking: The final ranking considers not just relevance to the query, but also how pieces of information complement each other to provide a comprehensive answer.

Real-world Benefits

This fusion approach addresses the limitations of individual retrieval methods:

Improved Recall: By leveraging multiple retrieval strategies, the system captures relevant information that might be missed by any single approach.
Enhanced Precision: The combination of ColBERT’s token-level precision with the contextual understanding of RAPTOR and LightRAG’s relational awareness leads to more accurate results.
Contextual Depth: The system can provide both broad overviews and specific details, adapting to the user’s information needs.
Complex Query Handling: Multi-hop questions that require connecting information across documents become manageable through LightRAG’s graph traversal capabilities.

The Future of Retrieval

As we look ahead, this fusion of ColBERT, LightRAG, and RAPTOR represents the cutting edge of retrieval technology. The approach moves beyond simple keyword matching or even pure semantic search to create a more human-like understanding of information—one that recognizes precise details, understands relationships between concepts, and grasps both the forest and the trees.

For enterprises dealing with vast knowledge bases, research institutions navigating complex scientific literature, or content platforms seeking to enhance user experience, this hybrid approach offers a powerful solution that mimics human information processing while leveraging the speed and scale of modern computing.

The future of retrieval isn’t about choosing between these approaches—it’s about bringing them together in harmony to create systems that truly understand the complexity and interconnectedness of human knowledge.

Loic Baconnier

Creating an MCP Server from Any FastAPI URL with One Prompt

Publié le 30 mars 2025 par loic

In the rapidly evolving landscape of AI-assisted development, the Model Context Protocol (MCP) has emerged as a game-changer. But what if you want to connect your AI assistants to existing FastAPI applications without modifying their code? Today, I’ll show you how to create an automatic MCP server from any FastAPI URL using just one prompt in Cursor.

The Power of FastAPI’s OpenAPI Documentation

FastAPI automatically generates comprehensive OpenAPI (formerly Swagger) documentation for all endpoints. This documentation contains everything needed to understand and interact with the API:

Endpoint paths and HTTP methods
Request parameters and body schemas
Response formats and status codes
Detailed descriptions and examples

This rich metadata is exactly what we need to create an MCP server that can proxy requests to the original API.

The One-Prompt Solution

Copy and paste this prompt into Cursor to generate a complete, ready-to-run MCP server that connects to any FastAPI application:

Create a complete Python script that generates an MCP server from the FastAPI application running at {URL}. The script should:

1. Fetch the OpenAPI/Swagger documentation from {URL}/openapi.json
2. Analyze all endpoints, parameters, request bodies, and response models
3. Create a new FastAPI application that:
   - Mirrors all the endpoints from the original API
   - Forwards requests to the original API
   - Returns responses from the original API
4. Add MCP server functionality using the fastapi_mcp library
5. Include proper error handling for:
   - Connection issues
   - Authentication failures
   - Invalid responses

The final script should be a single, self-contained Python file that:
- Takes command line arguments for customization (port, authentication, etc.)
- Includes detailed comments explaining how it works
- Can be run directly with "python script.py" to start the MCP server
- Automatically connects to {URL} and creates an MCP server at http://localhost:8000/mcp

Replace {URL} with the actual URL of the FastAPI application, for example https://api.example.com.

The output should be ONLY the complete Python script, ready to run, with no explanations before or after the code.

How to Use This Prompt

Replace {URL} with the actual URL of the FastAPI application you want to connect to

For example: https://api.example.com or http://localhost:8000

Paste the prompt into Cursor or another AI coding assistant
Copy the generated Python script and save it as mcp_bridge.py
Run the script with Python:

python mcp_bridge.py

Connect your AI assistant to the MCP server at http://localhost:8000/mcp

That’s it! No manual coding, no configuration files, no complex setup. Just one prompt and you have a fully functional MCP server that connects to any FastAPI application.

What Makes This Approach Special

This solution is unique because:

It requires zero knowledge of MCP or FastAPI – the AI does all the work
It works with any FastAPI application that has OpenAPI documentation enabled
It preserves all the original API’s functionality including parameters, schemas, and documentation
It creates a production-ready MCP server with proper error handling and logging
It’s completely automated – no manual intervention required

Real-World Applications

This approach opens up exciting possibilities:

Connect AI assistants to your company’s internal APIs without modifying them
Create MCP bridges to public APIs that use FastAPI
Test MCP functionality before implementing it directly in your codebase
Provide AI access to legacy systems through a FastAPI proxy

Conclusion

The ability to create MCP servers from existing FastAPI URLs with just one prompt is a game-changer for AI-assisted development. You can now connect your favorite AI assistants to any FastAPI application in minutes, without writing a single line of code yourself.

Try this approach today and experience the power of combining FastAPI’s excellent documentation with the flexibility of the Model Context Protocol!

Loic Baconnier

Automate MCP Integration in Your FastAPI App with a Single Copy/Paste

Publié le 30 mars 2025 par loic

Modern APIs need more than just endpoints—they require robust documentation, strong typing, and seamless integration with advanced AI assistants. In our fast-paced development environment, every minute counts. That’s why today we’re exploring how to leverage a single, well-crafted Cursor prompt to automatically refactor an existing FastAPI application and integrate the Model Context Protocol (MCP) with zero extra manual adjustments.

What Is MCP and Why Does It Matter?

MCP (Model Context Protocol) is a lightweight framework that enables AI assistants to interact with your APIs. By converting your API endpoints into well-documented, standardized MCP tools, AI models (like those running in Cursor or Claude 3.7 Sonnet) can automatically discover and call your API functions. This not only enhances interoperability but also allows for dynamic, natural-language-driven interactions with your app.

Why Improve Your FastAPI Code First?

Before unlocking the power of MCP, your API needs to be in top shape. This means:

Comprehensive docstrings for each endpoint.
Detailed type hints and Pydantic models for requests and responses.
Robust error handling with proper HTTP exceptions.
Clear descriptions for every route so that MCP can easily « discover » and interpret them.

By improving your code according to these best practices, you’re ensuring that the MCP integration can accurately reflect your API’s capabilities, leading to smarter, reliable interactions for AI assistants.

Automating Everything with a Cursor Prompt

Imagine being able to improve your code—and add a whole new MCP interface—to your FastAPI project by simply pasting one prompt into Cursor. No more manual tweaks or back-and-forth adjustments. The idea is to use a precise instruction that tells the AI exactly how to:

Refactor your existing code for better quality.
Automatically insert MCP integration using the fastapi_mcp library.
Generate the final, runnable code along with testing and configuration instructions.

Here’s why this approach is so powerful:

It removes the need for manual intervention.
It standardizes your API transformation process.
It sparks creativity by letting the AI fill in the boilerplate, making your API production-ready with minimal hassle.
It works with non-perfect AI systems by laying out each necessary step, ensuring no detail is lost.

The Final Cursor Prompt

Copy and paste the following prompt directly into Cursor. This instruction tells the AI to first improve your existing FastAPI code with best practices and then add the MCP route using the fastapi_mcp library—all in one go:

I have an existing FastAPI application that is functional but not optimized. Your job is to improve the code and integrate MCP (Model Context Protocol) using the fastapi_mcp library. Follow these steps carefully:

### Step 1: Improve the Existing FastAPI Code
1. **Docstrings**: Add detailed docstrings to all endpoints. Each docstring should include:
   - A brief description of what the endpoint does.
   - Parameters with their types and descriptions.
   - The response format, including success and error cases.
   - HTTP status codes used by the endpoint.
2. **Type Hints**: Ensure all functions have proper type hints for parameters and return values.
3. **Pydantic Models**:
   - Define Pydantic models for request bodies (if any).
   - Use Pydantic models for response validation (`response_model` in FastAPI).
4. **Error Handling**:
   - Use `HTTPException` with appropriate status codes for errors.
   - Handle edge cases gracefully with meaningful error messages.
5. **Endpoint Descriptions**: Add a `description` parameter to each route decorator to describe what the endpoint does.

### Step 2: Integrate MCP
1. Install the `fastapi_mcp` library:
   ```
   pip install fastapi_mcp
   ```
2. Import the necessary function:
   ```
   from fastapi_mcp import add_mcp_server
   ```
3. Add MCP functionality to the FastAPI app:
   - After initializing your `FastAPI` app, call `add_mcp_server()`.
   - Mount the MCP server at `/mcp`.
   - Use a descriptive name for your MCP server (e.g., "My API MCP").
4. Ensure that all existing endpoints remain functional after adding the MCP server.

### Step 3: Provide Testing Instructions
1. Generate a JSON configuration snippet to connect this MCP server in Cursor:
   ```
   {
     "mcpServers": {
       "My API MCP": {
         "url": "http://127.0.0.1:8000/mcp"
       }
     }
   }
   ```
2. Provide a sample `curl` command to test the `/mcp` endpoint:
   ```
   curl -X POST http://127.0.0.1:8000/mcp/tools
   ```

### Input Example
Here is an example of my current FastAPI code (simplified):
```
from fastapi import FastAPI, HTTPException

app = FastAPI()

@app.get("/items")
def get_items():
    return {"items": []}

@app.get("/items/{item_id}")
def get_item(item_id: int):
    if item_id == 0:
        raise HTTPException(status_code=404, detail="Item not found")
    return {"item_id": item_id}
```

### Output Requirements
- Refactor the above code to follow best practices (as outlined in Step 1).
- Add MCP integration (as described in Step 2).
- Provide a complete, runnable code block with comments explaining each change.
- Include testing instructions (as described in Step 3).

The final output should look like this:
1. The improved and MCP-integrated code.
2. A JSON snippet for connecting this API as an MCP server in Cursor.
3. A sample `curl` command to test the `/mcp` route.

DO NOT skip any steps or provide vague explanations—output only complete, ready-to-use code.

How This Works

By pasting the above prompt into Cursor, you delegate the entire transformation process to the AI assistant. It will:

Refactor your code to meet professional standards.
Automatically insert the MCP integration using fastapi_mcp.
Produce a self-contained code snippet with detailed comments and testing instructions.

This means you can convert an imperfect API into a fully MCP-compliant service without directly writing additional code!

Conclusion

This method not only accelerates your development process but also minimizes human error by standardizing integration tasks. With one thoughtfully constructed prompt, you can harness the power of AI to bring your FastAPI application up to production level—complete with modern documentation and remote AI assistant compatibility via the MCP protocol.

Try it out in your next project and experience a new level of automation that allows you to focus on what matters most: building innovative features while letting the AI take care of the boilerplate.

Loic Baconnier

AI Coding Assistant Rules for Windsurf and Cursor

Publié le 11 mars 2025 par loic

These optimized rules will transform how Windsurf and Cursor AI work with your Python backend and Next.js frontend projects. By adding these configurations, you’ll get more accurate, consistent code suggestions that follow best practices and avoid common AI-generated code issues.

How to Implement in Windsurf

Option 1 – File Method:

Create a file named .windsurfrules in your project’s root directory
Copy and paste the entire code block below into this file
Save the file

Option 2 – Settings Method:

Open Windsurf AI
Navigate to Settings > Set Workspace AI Rules > Edit Rules
Paste the entire code block below
Save your settings

How to Implement in Cursor

Option 1 – File Method:

Create a file named .cursorrules in your project’s root directory
Copy and paste the same code block below (it works for both platforms)
Save the file

Option 2 – Settings Method:

Open Cursor AI
Click on your profile picture in the bottom left
Select « Settings »
Navigate to « AI » section
Find « Custom Instructions » and click « Edit »
Paste the entire code block below
Click « Save »

After Implementation

Restart your AI coding assistant or reload your workspace
The AI will now follow these comprehensive rules in all your coding sessions
You should immediately notice more relevant, project-specific code suggestions

These rules will significantly improve how your AI coding assistant understands your project requirements, coding standards, and technical preferences. You’ll get more relevant suggestions, fewer hallucinated functions, and code that better integrates with your existing codebase.

# Cursor Rules and Workflow Guide

## Core Configuration

- **Version**: `v5`
- **Project Type**: `web_application`
- **Code Style**: `clean_and_maintainable`
- **Environment Support**: `dev`, `test`, `prod`

---

## Test-Driven Development (TDD) Rules

1. Write tests **first** before any production code.
2. Run tests before implementing new functionality.
3. Write the **minimal code** required to pass tests.
4. Refactor only after all tests pass.
5. Do not start new tasks until all tests are passing.
6. Place all tests in a dedicated `/tests` directory.
7. Explain why tests will initially fail before implementation.
8. Propose an implementation strategy before writing code.
9. Check for existing functionality before creating new features.

---

## Code Quality Standards

- Maximum file length: **300 lines** (split into modules if needed).
- Follow existing patterns and project structure.
- Write modular, reusable, and maintainable code.
- Implement proper error handling mechanisms.
- Use type hints and annotations where applicable.
- Add explanatory comments when necessary.
- Avoid code duplication; reuse existing functionality if possible.
- Prefer simple solutions over complex ones.
- Keep the codebase clean and organized.

---

## AI Assistant Behavior

1. Explain understanding of requirements before proceeding with tasks.
2. Ask clarifying questions when requirements are ambiguous or unclear.
3. Provide complete, working solutions for each task or bug fix.
4. Focus only on relevant areas of the codebase for each task.
5. Debug failing tests with clear explanations and reasoning.

---

## Things to Avoid

- Never generate incomplete or partial solutions unless explicitly requested.
- Never invent nonexistent functions, APIs, or libraries.
- Never ignore explicit requirements or provided contexts.
- Never overcomplicate simple tasks or solutions.
- Never overwrite `.env` files without explicit confirmation.

---

## Implementation Guidelines

### General Rules

1. Always check for existing code before creating new functionality.
2. Avoid major changes to patterns unless explicitly instructed or necessary for bug fixes.

### Environment-Specific Rules

- Mock data should only be used for **tests**, never for development or production environments.

### File Management

- Avoid placing scripts in files if they are intended to be run only once.

### Refactoring Rules

1. Refactor files exceeding **300 lines** to improve readability and maintainability.

### Bug Fixing Rules

1. Exhaust all options using existing patterns and technologies before introducing new ones.
2. If introducing a new pattern, remove outdated implementations to avoid duplicate logic.

---

## Workflow Best Practices

### Planning \& Task Management

1. Use Markdown files (`PLANNING.md`, `TASK.md`) to manage project scope and tasks:
- **PLANNING.md**: High-level vision, architecture, constraints, tech stack, tools, etc.
- **TASK.md**: Tracks current tasks, backlog, milestones, and discovered issues during development.
2. Always update these files as the project progresses:
- Mark completed tasks in `TASK.md`.
- Add new sub-tasks or TODOs discovered during development.

### Code Structure \& Modularity

1. Organize code into clearly separated modules grouped by feature or responsibility.
2. Use consistent naming conventions and file structures as described in `PLANNING.md`.
3. Never create a file longer than 500 lines of code; refactor into modules if necessary.

### Testing \& Reliability

1. Create unit tests for all new features (functions, classes, routes, etc.).
2. Place all tests in a `/tests` folder mirroring the main app structure:
- Include at least:
- 1 test for expected use,
- 1 edge case,
- 1 failure case (to ensure proper error handling).
3. Mock external services (e.g., databases) in tests to avoid real-world interactions.

### Documentation \& Explainability

1. Update `README.md` when adding features, changing dependencies, or modifying setup steps.
2. Write docstrings for every function using Google-style formatting:

```python
def example(param1: int) -&gt; str:
"""
Brief summary of the function.

Args:
param1 (int): Description of the parameter.

Returns:
str: Description of the return value.
"""
```

3. Add inline comments explaining non-obvious logic and reasoning behind decisions.

---

## Verification Rule

I am an AI coding assistant that strictly adheres to Test-Driven Development (TDD) principles and high code quality standards. I will:

1. Write tests **first** before any production code.
2. Place all tests in a dedicated `/tests` directory.
3. Explain why tests initially fail before implementation begins.
4. Write minimal production code to pass the tests.
5. Refactor while maintaining passing tests at all times.
6. Enforce a maximum file length of **300 lines** per file (or 500 lines if specified).
7. Check for existing functionality before writing new code or features.
8. Explain my understanding of requirements before starting implementation work.
9. Ask clarifying questions when requirements are ambiguous or unclear.
10. Propose implementation strategies before writing any production code.
11. Debug failing tests with clear reasoning and explanations provided step-by-step.

---

## Server Management Best Practices

1. Restart servers after making changes to test them properly (only when necessary).
2. Kill all related servers from previous testing sessions to avoid conflicts.

---

## Modular Prompting Process After Initial Prompt

When interacting with the AI assistant:

1. Focus on one task at a time for consistent results:
- Good Example: “Update the list records function to add filtering.”
- Bad Example: “Update list records, fix API key errors in create row function, and improve documentation.”
2. Always test after implementing every feature to catch bugs early:
- Create unit tests covering:
- Successful scenarios,
- Edge cases,
- Failure cases.

---

These rules combine best practices for Python backend and Next.js frontend development with your specific coding patterns, workflow preferences, and technical stack requirements. The configuration instructs Windsurf AI to maintain clean, modular code that follows established patterns while avoiding common pitfalls in AI-assisted development.

Loic Baconnier

See also https://github.com/bacoco/awesome-cursorrules from
PatrickJS/awesome-cursorrules

Enhancing Document Retrieval with Topic-Based Chunking and RAPTOR

Publié le 11 mars 2025 par loic

In the evolving landscape of information retrieval, combining topic-based chunking with hierarchical retrieval methods like RAPTOR represents a significant advancement for handling complex, multi-topic documents. This article explores how these techniques work together to create more effective document understanding and retrieval systems.

Topic-Based Chunking: Understanding Document Themes

Topic-based chunking segments text by identifying and grouping content related to specific topics, creating more semantically meaningful chunks than traditional fixed-size approaches. This method is particularly valuable for multi-topic documents where maintaining thematic coherence is essential.

The TopicNodeParser in LlamaIndex provides an implementation of this approach:

It analyzes documents to identify natural topic boundaries
It segments text based on semantic similarity rather than arbitrary token counts
It preserves the contextual relationships between related content

After processing documents with TopicNodeParser, you can extract the main topics from each node using an LLM. This creates a comprehensive topic map of your document collection, which serves as the foundation for more sophisticated retrieval.

RAPTOR: Hierarchical Retrieval for Complex Documents

RAPTOR (Recursive Abstractive Processing for Tree Organized Retrieval) builds on chunked documents by organizing information in a hierarchical tree structure through recursive clustering and summarization. This approach outperforms traditional retrieval methods by preserving document relationships and providing multiple levels of abstraction.

Choosing the Right RAPTOR Method

RAPTOR offers two primary retrieval methods, each with distinct advantages for different use cases:

Tree Traversal Retrieval navigates the hierarchical structure sequentially, starting from root nodes and moving down through relevant branches. This method is ideal for:

Getting comprehensive overviews of multiple documents
Understanding the big picture before exploring details
Queries requiring progressive exploration from general to specific information
Press reviews or reports where logical flow between concepts is important

Collapsed Tree Retrieval flattens the tree structure, evaluating all nodes simultaneously regardless of their position in the hierarchy. This method excels at:

Complex multi-topic queries requiring information from various levels
Situations needing both summary-level and detailed information simultaneously
Multiple recall scenarios where information is scattered across documents
Syndicate press reviews with multiple intersecting topics

Research has shown that the collapsed tree method consistently outperforms traditional top-k retrieval, achieving optimal results when searching for the top 20 nodes containing up to 2,000 tokens. For most multi-document scenarios with diverse topics, the collapsed tree approach is generally superior.

Creating Interactive Topic-Based Summaries

The final piece of an effective document retrieval system is interactive topic-based summarization, which allows users to explore document collections at varying levels of detail.

An interactive topic-based summary:

Presents topics hierarchically, showing their development throughout documents
Allows users to expand or collapse sections based on interest
Provides contextual placement of topics within the overall document structure
Uses visual cues like indentation, bullets, or font changes to indicate hierarchy

This approach transforms complex summarization results into comprehensible visual summaries that help users navigate large text collections more effectively.

Implementing a Complete Pipeline

A comprehensive implementation combines these techniques into a seamless pipeline:

Topic Identification: Use TopicNodeParser to segment documents into coherent topic-based chunks
Topic Extraction: Apply an LLM to identify and name the main topics in each chunk
Hierarchical Organization: Process these topic-based chunks with RAPTOR to create a multi-level representation
Retrieval Optimization: Select the appropriate RAPTOR method based on your specific use case
Interactive Summary: Create an interactive interface that allows users to explore topics at multiple levels of detail

This pipeline ensures that no topics are lost during processing while providing users with both high-level overviews and detailed information when needed.

Conclusion

The combination of topic-based chunking, RAPTOR’s hierarchical retrieval, and interactive summarization represents a powerful approach for handling complex, multi-topic document collections. By preserving the semantic structure of documents while enabling flexible retrieval at multiple levels of abstraction, these techniques significantly enhance our ability to extract meaningful information from large text collections.

As these technologies continue to evolve, we can expect even more sophisticated approaches to document understanding and retrieval that will further transform how we interact with textual information.

Loic Baconnier

Introducing Chonkie: The Lightweight RAG Chunking Library

Publié le 26 janvier 2025 par loic

Meet Chonkie, a revolutionary new Python library that’s transforming the way we handle text chunking for RAG (Retrieval-Augmented Generation) applications. This lightweight powerhouse combines simplicity with performance, making it an essential tool for AI developers[3].

Key Features

Core Capabilities

Feature-rich implementation with comprehensive chunking methods
Lightning-fast performance with minimal resource requirements
Universal tokenizer support for maximum flexibility[3]

Chunking Methods
The library offers multiple specialized chunkers:

TokenChunker for fixed-size token splits
WordChunker for word-based divisions
SentenceChunker for sentence-level processing
RecursiveChunker for hierarchical text splitting
SemanticChunker for similarity-based chunking
SDPMChunker utilizing Semantic Double-Pass Merge[3]

Implementation

Getting started with Chonkie is straightforward. Here’s a basic example:

from chonkie import TokenChunker
from tokenizers import Tokenizer

# Initialize tokenizer
tokenizer = Tokenizer.from_pretrained(« gpt2 »)

# Create chunker chunker = TokenChunker(tokenizer)

# Process text
chunks = chunker(« Woah! Chonkie, the chunking library is so cool! »)

# Access results for chunk in chunks: print(f »Chunk: {chunk.text} ») print(f »Tokens: {chunk.token_count} »)

Performance Metrics

The library demonstrates impressive performance:

Default installation size: 11.2MB
Token chunking speed: 33x faster than alternatives
Sentence chunking: 2x performance improvement
Semantic chunking: 2.5x speed increase[3]

Installation Options

Two installation methods are available:

# Minimal installation

pip install chonkie

# Full installation with all features

pip install chonkie[all]

Also Semantic Chunkers

Semantic Chunkers is a multi-modal chunking library for intelligent chunking of text, video, and audio. It makes your AI and data processing more efficient and accurate.

https://github.com/aurelio-labs/semantic-chunkers?tab=readme-ov-file

Sources
[1] activity https://github.com/chonkie-ai/chonkie/activity
[2] Activity · chonkie-ai/chonkie https://github.com/chonkie-ai/chonkie/activity
[3] chonkie/README.md at main · chonkie-ai/chonkie https://github.com/chonkie-ai/chonkie/blob/main/README.md