Mobile MCP

Mobile MCP

Platform-agnostic server for scalable mobile automation and development.

2,436
Stars
217
Forks
2,436
Watchers
32
Issues
Mobile MCP is a Model Context Protocol (MCP) server that enables scalable automation and interaction with native iOS and Android devices through a unified, platform-independent API. Designed to power agents and LLMs, it supports both simulator/emulator and real device environments, allowing access via structured accessibility snapshots or coordinate-based actions. The server facilitates multi-step user journeys, data extraction, and agent-based frameworks without requiring device-specific expertise.

Key Features

Unified platform-agnostic interface for mobile devices
Supports both iOS and Android (simulators, emulators, real devices)
Structured accessibility tree and snapshot support
Screenshot-based coordinate action fallback
Enables LLM and agent-driven interaction
Visual analysis of device screens
Scripted flow and form automation
Deterministic tool application
No dependency on device-specific knowledge
Agent-to-agent communication for automation

Use Cases

Automating mobile app testing workflows
Scripted multi-step user journey execution for data-entry
Driving mobile UI automation with language models
Extracting structured data from app interfaces
DIY robotic process automation for mobile environments
Seamless agent-driven operation of iOS and Android apps
Cross-device interactions for QA testing
Coordinating interactions between multiple automation agents
Accessibility-centric app analysis and interaction
Remote control and automation of real or virtual mobile devices

README

Mobile Next - MCP server for Mobile Development and Automation | iOS, Android, Simulator, Emulator, and Real Devices

This is a Model Context Protocol (MCP) server that enables scalable mobile automation, development through a platform-agnostic interface, eliminating the need for distinct iOS or Android knowledge. You can run it on emulators, simulators, and real devices (iOS and Android). This server allows Agents and LLMs to interact with native iOS/Android applications and devices through structured accessibility snapshots or coordinate-based taps based on screenshots.

https://github.com/user-attachments/assets/c4e89c4f-cc71-4424-8184-bdbc8c638fa1

πŸš€ Mobile MCP Roadmap: Building the Future of Mobile

Join us on our journey as we continuously enhance Mobile MCP! Check out our detailed roadmap to see upcoming features, improvements, and milestones. Your feedback is invaluable in shaping the future of mobile automation.

πŸ‘‰ Explore the Roadmap

Main use cases

How we help to scale mobile automation:

  • πŸ“² Native app automation (iOS and Android) for testing or data-entry scenarios.
  • πŸ“ Scripted flows and form interactions without manually controlling simulators/emulators or real devices (iPhone, Samsung, Google Pixel etc)
  • 🧭 Automating multi-step user journeys driven by an LLM
  • πŸ‘† General-purpose mobile application interaction for agent-based frameworks
  • πŸ€– Enables agent-to-agent communication for mobile automation usecases, data extraction

Main Features

  • πŸš€ Fast and lightweight: Uses native accessibility trees for most interactions, or screenshot based coordinates where a11y labels are not available.
  • πŸ€– LLM-friendly: No computer vision model required in Accessibility (Snapshot).
  • 🧿 Visual Sense: Evaluates and analyses what’s actually rendered on screen to decide the next action. If accessibility data or view-hierarchy coordinates are unavailable, it falls back to screenshot-based analysis.
  • πŸ“Š Deterministic tool application: Reduces ambiguity found in purely screenshot-based approaches by relying on structured data whenever possible.
  • πŸ“Ί Extract structured data: Enables you to extract structred data from anything visible on screen.

πŸ”§ Available MCP Tools

For detailed implementation and parameter specifications, see src/server.ts

Device Management

  • mobile_list_available_devices - List all available devices (simulators, emulators, and real devices)
  • mobile_get_screen_size - Get the screen size of the mobile device in pixels
  • mobile_get_orientation - Get the current screen orientation of the device
  • mobile_set_orientation - Change the screen orientation (portrait/landscape)

App Management

  • mobile_list_apps - List all installed apps on the device
  • mobile_launch_app - Launch an app using its package name
  • mobile_terminate_app - Stop and terminate a running app
  • mobile_install_app - Install an app from file (.apk, .ipa, .app, .zip)
  • mobile_uninstall_app - Uninstall an app using bundle ID or package name

Screen Interaction

  • mobile_take_screenshot - Take a screenshot to understand what's on screen
  • mobile_save_screenshot - Save a screenshot to a file
  • mobile_list_elements_on_screen - List UI elements with their coordinates and properties
  • mobile_click_on_screen_at_coordinates - Click at specific x,y coordinates
  • mobile_double_tap_on_screen - Double-tap at specific coordinates
  • mobile_long_press_on_screen_at_coordinates - Long press at specific coordinates
  • mobile_swipe_on_screen - Swipe in any direction (up, down, left, right)

Input & Navigation

  • mobile_type_keys - Type text into focused elements with optional submit
  • mobile_press_button - Press device buttons (HOME, BACK, VOLUME_UP/DOWN, ENTER, etc.)
  • mobile_open_url - Open URLs in the device browser

Platform Support

  • iOS: Simulators and real devices via native accessibility and WebDriverAgent
  • Android: Emulators and real devices via ADB and UI Automator
  • Cross-platform: Unified API works across both iOS and Android

πŸ—οΈ Mobile MCP Architecture

πŸ“š Wiki page

More details in our wiki page for setup, configuration and debugging related questions.

Installation and configuration

Standard config works in most of the tools:

json
{
  "mcpServers": {
    "mobile-mcp": {
      "command": "npx",
      "args": ["-y", "@mobilenext/mobile-mcp@latest"]
    }
  }
}

To setup Cline, just add the json above to your MCP settings file.

More in our wiki

Use the Claude Code CLI to add the Mobile MCP server:

bash
claude mcp add mobile-mcp -- npx -y @mobilenext/mobile-mcp@latest

Click the button to install:

Or install manually:

Go to Cursor Settings -> MCP -> Add new MCP Server. Name to your liking, use command type with the command npx -y @mobilenext/mobile-mcp@latest. You can also verify config or add command like arguments via clicking Edit.

Use the Gemini CLI to add the Mobile MCP server:

bash
gemini mcp add mobile-mcp npx -y @mobilenext/mobile-mcp@latest

Click the button to install:

Install in Goose

Or install manually:

Go to Advanced settings -> Extensions -> Add custom extension. Name to your liking, use type STDIO, and set the command to npx -y @mobilenext/mobile-mcp@latest. Click "Add Extension".

Open Qodo Gen chat panel in VSCode or IntelliJ β†’ Connect more tools β†’ + Add new MCP β†’ Paste the standard config above.

Click Save.

Read more in our wiki! πŸš€

πŸ› οΈ How to Use πŸ“

After adding the MCP server to your IDE/Client, you can instruct your AI assistant to use the available tools. For example, in Cursor's agent mode, you could use the prompts below to quickly validate, test and iterate on UI intereactions, read information from screen, go through complex workflows. Be descriptive, straight to the point.

✨ Example Prompts

Workflows

You can specifiy detailed workflows in a single prompt, verify business logic, setup automations. You can go crazy:

Search for a video, comment, like and share it.

Find the video called " Beginner Recipe for Tonkotsu Ramen" by Way of
Ramen, click on like video, after liking write a comment " this was
delicious, will make it next Friday", share the video with the first
contact in your whatsapp list.

Download a successful step counter app, register, setup workout and 5-star the app

Find and Download a free "Pomodoro" app that has more than 1k stars.
Launch the app, register with my email, after registration find how to
start a pomodoro timer. When the pomodoro timer started, go back to the
app store and rate the app 5 stars, and leave a comment how useful the
app is.

Search in Substack, read, highlight, comment and save an article

Open Substack website, search for "Latest trends in AI automation 2025",
open the first article, highlight the section titled "Emerging AI trends",
and save article to reading list for later review, comment a random
paragraph summary.

Reserve a workout class, set timer

Open ClassPass, search for yoga classes tomorrow morning within 2 miles,
book the highest-rated class at 7 AM, confirm reservation,
setup a timer for the booked slot in the phone

Find a local event, setup calendar event

Open Eventbrite, search for AI startup meetup events happening this
weekend in "Austin, TX", select the most popular one, register and RSVP
yes to the event, setup a calendar event as a reminder.

Check weather forecast and send a Whatsapp/Telegram/Slack message

Open Weather app, check tomorrow's weather forecast for "Berlin", and
send the summary via Whatsapp/Telegram/Slack to contact "Lauren Trown",
thumbs up their response.
  • Schedule a meeting in Zoom and share invite via email
Open Zoom app, schedule a meeting titled "AI Hackathon" for tomorrow at
10AM with a duration of 1 hour, copy the invitation link, and send it via
Gmail to contacts "team@example.com".

More prompt examples can be found here.

Prerequisites

What you will need to connect MCP with your agent and mobile devices:

Simulators, Emulators, and Real Devices

When launched, Mobile MCP can connect to:

  • iOS Simulators on macOS/Linux
  • Android Emulators on Linux/Windows/macOS
  • iOS or Android real devices (requires proper platform tools and drivers)

Make sure you have your mobile platform SDKs (Xcode, Android SDK) installed and configured properly before running Mobile Next Mobile MCP.

Running in "headless" mode on Simulators/Emulators

When you do not have a real device connected to your machine, you can run Mobile MCP with an emulator or simulator in the background.

For example, on Android:

  1. Start an emulator (avdmanager / emulator command).
  2. Run Mobile MCP with the desired flags

On iOS, you'll need Xcode and to run the Simulator before using Mobile MCP with that simulator instance.

  • xcrun simctl list
  • xcrun simctl boot "iPhone 16"

Thanks to all contributors ❀️

We appreciate everyone who has helped improve this project.

Star History

Star History Chart

Repository Owner

mobile-next
mobile-next

Organization

Repository Details

Language TypeScript
Default Branch main
Size 2,252 KB
Contributors 18
License Apache License 2.0
MCP Verified Nov 12, 2025

Programming Languages

TypeScript
94.86%
JavaScript
5.14%

Tags

Topics

agent android emulator ios mcp mobile physical real simulator

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

We respect your privacy. Unsubscribe at any time.

Related MCPs

Discover similar Model Context Protocol servers

  • AutoMobile

    AutoMobile

    Powerful tools for mobile automation, test authoring, and device management via MCP.

    AutoMobile provides a comprehensive set of tools for mobile automation, focusing on UI testing and development workflow automation. It operates as an MCP Server, enabling a robust interaction loop for model-driven actions and observations. The solution supports Android platforms with features like automated test authoring, multi-device management, and seamless CI test execution. AutoMobile also offers source mapping and deep view hierarchy analysis to enhance code rendering accuracy.

    • ⭐ 63
    • MCP
    • zillow/auto-mobile
  • HarmonyOS MCP Server

    HarmonyOS MCP Server

    Enables HarmonyOS device manipulation via the Model Context Protocol.

    HarmonyOS MCP Server provides an MCP-compatible server that allows programmatic control of HarmonyOS devices. It integrates with tools and frameworks such as OpenAI's openai-agents SDK and LangGraph to facilitate LLM-powered automation workflows. The server supports execution through standard interfaces and can be used with agent platforms to process natural language instructions for device actions. Its design allows for seamless interaction with HarmonyOS systems using the Model Context Protocol.

    • ⭐ 25
    • MCP
    • XixianLiang/HarmonyOS-mcp-server
  • ScreenPilot

    ScreenPilot

    Empower LLMs with full device control through screen automation.

    ScreenPilot provides an MCP server interface to enable large language models to interact with and control graphical user interfaces on a device. It offers a comprehensive toolkit for screen capture, mouse control, keyboard input, scrolling, element detection, and action sequencing. The toolkit is suitable for automation, education, and experimentation, allowing AI agents to perform complex operations on a user’s device.

    • ⭐ 50
    • MCP
    • Mtehabsim/ScreenPilot
  • omniparser-autogui-mcp

    omniparser-autogui-mcp

    Automated GUI analysis and interaction via the Model Context Protocol.

    omniparser-autogui-mcp is an MCP server that leverages OmniParser to analyze on-screen content and perform automated GUI operations. It integrates with clients such as Claude Desktop and can be configured via a detailed environment setup. The tool supports Windows and can delegate OmniParser processing to external devices, offering flexibility for complex contexts. Multiple environment variables allow customization of backend processing, target window selection, and communication methods, including SSE.

    • ⭐ 58
    • MCP
    • NON906/omniparser-autogui-mcp
  • JADX-AI-MCP

    JADX-AI-MCP

    Automated AI-powered APK analysis via Model Context Protocol.

    JADX-AI-MCP is a fully automated server and plugin for integrating Model Context Protocol (MCP) with JADX for the purpose of analyzing Android APKs using large language models such as Claude. It streamlines vulnerability discovery, reverse engineering, and static analysis by leveraging LLMs in conjunction with established tools. The project facilitates real-time code review and efficient collaboration between AI and human analysts.

    • ⭐ 637
    • MCP
    • zinja-coder/jadx-ai-mcp
  • Xcode MCP Server

    Xcode MCP Server

    Comprehensive Xcode integration server for AI assistants using the Model Context Protocol.

    Xcode MCP Server provides an MCP-compliant interface for AI agents to interact with Xcode projects on macOS. It supports project management, simulator control, CocoaPods and Swift Package Manager integration, and advanced file and build operations. Enhanced error handling and multi-project support enable seamless automation and context management for complex Xcode workflows.

    • ⭐ 330
    • MCP
    • r-huijts/xcode-mcp-server
  • Didn't find tool you were looking for?

    Be as detailed as possible for better results