Agent skill

file-processing

Data file processing utilities for CSV, JSON, and text files. Provides helpers for reading, transforming, and validating structured data.

View SKILL.md on GitHub Repository

Stars 408

Forks 47

Install this agent skill to your Project

npx add-skill https://github.com/baidu-baige/LoongFlow/tree/main/.claude/skills/file-processing

SKILL.md

File Processing Skill

This skill provides utilities and guidance for building robust file processing applications.

Purpose

Use this skill when your task involves:

Reading and parsing CSV, JSON, or text files
Data validation and cleaning
File format conversions
Batch processing of multiple files
Generating reports from data files

Key Capabilities

1. Data Reading

CSV parsing with header detection
JSON file handling (single object or array)
Text file processing line-by-line
Error handling for malformed files

2. Data Validation

Check for required fields
Validate data types
Handle missing values
Report data quality issues

3. Data Transformation

Filter rows based on conditions
Calculate statistics (sum, avg, count)
Format conversions
Data aggregation

4. Output Generation

Write processed data to new files
Generate summary reports
Create multiple output formats

Best Practices

Project Structure for File Processing

project/
├── main.py              # Entry point with CLI
├── file_reader.py       # File I/O operations
├── data_processor.py    # Core processing logic
├── validator.py         # Data validation
├── config.py            # Configuration constants
└── utils.py             # Helper functions

Error Handling Pattern

python

def read_file_safely(filepath):
    """Read file with proper error handling"""
    try:
        if not os.path.exists(filepath):
            raise FileNotFoundError(f"File not found: {filepath}")

        with open(filepath, 'r', encoding='utf-8') as f:
            return f.read()
    except Exception as e:
        print(f"Error reading file: {e}")
        return None

CSV Processing Template

python

import csv

def process_csv(input_file, output_file):
    """Process CSV with header detection"""
    with open(input_file, 'r', encoding='utf-8') as f:
        reader = csv.DictReader(f)

        processed = []
        for row in reader:
            # Transform each row
            processed_row = transform_row(row)
            processed.append(processed_row)

    # Write results
    with open(output_file, 'w', encoding='utf-8') as f:
        if processed:
            writer = csv.DictWriter(f, fieldnames=processed[0].keys())
            writer.writeheader()
            writer.writerows(processed)

JSON Processing Template

python

import json

def process_json(input_file, output_file):
    """Process JSON data"""
    with open(input_file, 'r', encoding='utf-8') as f:
        data = json.load(f)

    # Process data (handle both list and dict)
    processed = process_data(data)

    with open(output_file, 'w', encoding='utf-8') as f:
        json.dump(processed, f, indent=2, ensure_ascii=False)

Common Patterns

1. CLI with Argument Parsing

python

import argparse

def main():
    parser = argparse.ArgumentParser(description='File Processor')
    parser.add_argument('input', help='Input file path')
    parser.add_argument('output', help='Output file path')
    parser.add_argument('--format', choices=['csv', 'json'], default='csv')

    args = parser.parse_args()
    process_file(args.input, args.output, args.format)

2. Batch Processing

python

import glob

def process_directory(input_dir, output_dir, pattern='*.csv'):
    """Process all matching files in directory"""
    files = glob.glob(os.path.join(input_dir, pattern))

    for filepath in files:
        filename = os.path.basename(filepath)
        output_path = os.path.join(output_dir, f"processed_{filename}")
        process_file(filepath, output_path)

3. Progress Reporting

python

def process_with_progress(items):
    """Process items with progress feedback"""
    total = len(items)
    for i, item in enumerate(items, 1):
        process_item(item)
        print(f"Progress: {i}/{total} ({i*100//total}%)", end='\r')
    print()  # New line when complete

Tools Available

When implementing file processing tasks, you have access to:

Read - Read file contents
Write - Create new files
Edit - Modify existing files
Glob - Find files by pattern
Bash - Run shell commands (e.g., wc -l, head)

Testing Tips

Always test your file processor with:

Empty files - Should handle gracefully
Malformed data - CSV with wrong column count, invalid JSON
Missing files - Should provide clear error messages
Large files - Consider memory usage
Special characters - Unicode, newlines in CSV fields

Example Task Breakdown

Task: "Create a CSV analyzer that calculates statistics"

Suggested Steps:

Read CSV file and detect headers
Parse data into structured format
Calculate statistics (count, sum, average) per column
Generate summary report
Write results to output file

Recommended Structure:

csv_analyzer.py - Main program
stats.py - Statistics calculations
report_generator.py - Format output

References

For more complex tasks, consider:

Python's csv module for CSV handling
json module for JSON operations
pathlib for cross-platform file paths
pandas for advanced data processing (if allowed)

Maintainer

baidu-baige Core maintainer

Source details

Full Name: baidu-baige/LoongFlow
Branch: main
Path in repo: .claude/skills/file-processing
License: Apache License 2.0
Topics: agent ai llm ai-agent multi-agent genetic-algorithm discovery baidu evolutionary-algorithms evolutionary-computation evolve iterative-methods iterative-refinement llm-ensemble loongflow openevolve-alternative optimize sota

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

baidu-baige/LoongFlow

skill-creator

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.

408 47

Explore

baidu-baige/LoongFlow

code-analysis

Code review and debugging assistant. Identifies bugs, performance issues, security vulnerabilities, and suggests optimizations.

408 47

Explore

baidu-baige/LoongFlow

loongflow

PEES (Plan-Execute-Evaluate-Summary) iterative problem-solving methodology with LoongFlow engine for complex tasks. Use when tasks need structured iteration, optimization, evolution, or when user mentions loongflow/PEES/PES.

408 47

Explore

petekp/claude-code-setup

ubiquitous-language

Extract a DDD-style ubiquitous language glossary from the current conversation, flagging ambiguities and proposing canonical terms. Saves to UBIQUITOUS_LANGUAGE.md. Use when user wants to define domain terms, build a glossary, harden terminology, create a ubiquitous language, or mentions "domain model" or "DDD".

20 6

Explore

petekp/claude-code-setup

every-style-editor

This skill should be used when reviewing or editing copy to ensure adherence to Every's style guide. It provides a systematic line-by-line review process for grammar, punctuation, mechanics, and style guide compliance.

20 6

Explore

petekp/claude-code-setup

manage-codex

Autonomous Codex batch orchestrator. Use for "/manage-codex", "manage codex", "use codex", "dispatch to codex", or long-running Codex work.

20 6

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

File Processing Skill

Purpose

Key Capabilities

1. Data Reading

2. Data Validation

3. Data Transformation

4. Output Generation

Best Practices

Project Structure for File Processing

Error Handling Pattern

CSV Processing Template

JSON Processing Template

Common Patterns

1. CLI with Argument Parsing

2. Batch Processing

3. Progress Reporting

Tools Available

Testing Tips

Example Task Breakdown

References

Recommended Agent Skills

skill-creator

code-analysis

loongflow

ubiquitous-language

every-style-editor

manage-codex