Mobile MCP

Platform-agnostic server for scalable mobile automation and development.

View on GitHub Visit Website

2,436

Stars

217

Forks

2,436

Watchers

Issues

Mobile MCP is a Model Context Protocol (MCP) server that enables scalable automation and interaction with native iOS and Android devices through a unified, platform-independent API. Designed to power agents and LLMs, it supports both simulator/emulator and real device environments, allowing access via structured accessibility snapshots or coordinate-based actions. The server facilitates multi-step user journeys, data extraction, and agent-based frameworks without requiring device-specific expertise.

Key Features

Unified platform-agnostic interface for mobile devices

Supports both iOS and Android (simulators, emulators, real devices)

Structured accessibility tree and snapshot support

Screenshot-based coordinate action fallback

Enables LLM and agent-driven interaction

Visual analysis of device screens

Scripted flow and form automation

Deterministic tool application

No dependency on device-specific knowledge

Agent-to-agent communication for automation

Use Cases

Automating mobile app testing workflows

Scripted multi-step user journey execution for data-entry

Driving mobile UI automation with language models

Extracting structured data from app interfaces

DIY robotic process automation for mobile environments

Seamless agent-driven operation of iOS and Android apps

Cross-device interactions for QA testing

Coordinating interactions between multiple automation agents

Accessibility-centric app analysis and interaction

Remote control and automation of real or virtual mobile devices

README

Mobile Next - MCP server for Mobile Development and Automation | iOS, Android, Simulator, Emulator, and Real Devices

This is a Model Context Protocol (MCP) server that enables scalable mobile automation, development through a platform-agnostic interface, eliminating the need for distinct iOS or Android knowledge. You can run it on emulators, simulators, and real devices (iOS and Android). This server allows Agents and LLMs to interact with native iOS/Android applications and devices through structured accessibility snapshots or coordinate-based taps based on screenshots.

https://github.com/user-attachments/assets/c4e89c4f-cc71-4424-8184-bdbc8c638fa1

🚀 Mobile MCP Roadmap: Building the Future of Mobile

Join us on our journey as we continuously enhance Mobile MCP! Check out our detailed roadmap to see upcoming features, improvements, and milestones. Your feedback is invaluable in shaping the future of mobile automation.

👉 Explore the Roadmap

Main use cases

How we help to scale mobile automation:

📲 Native app automation (iOS and Android) for testing or data-entry scenarios.
📝 Scripted flows and form interactions without manually controlling simulators/emulators or real devices (iPhone, Samsung, Google Pixel etc)
🧭 Automating multi-step user journeys driven by an LLM
👆 General-purpose mobile application interaction for agent-based frameworks
🤖 Enables agent-to-agent communication for mobile automation usecases, data extraction

Main Features

🚀 Fast and lightweight: Uses native accessibility trees for most interactions, or screenshot based coordinates where a11y labels are not available.
🤖 LLM-friendly: No computer vision model required in Accessibility (Snapshot).
🧿 Visual Sense: Evaluates and analyses what’s actually rendered on screen to decide the next action. If accessibility data or view-hierarchy coordinates are unavailable, it falls back to screenshot-based analysis.
📊 Deterministic tool application: Reduces ambiguity found in purely screenshot-based approaches by relying on structured data whenever possible.
📺 Extract structured data: Enables you to extract structred data from anything visible on screen.

🔧 Available MCP Tools

For detailed implementation and parameter specifications, see src/server.ts

Device Management

mobile_list_available_devices - List all available devices (simulators, emulators, and real devices)
mobile_get_screen_size - Get the screen size of the mobile device in pixels
mobile_get_orientation - Get the current screen orientation of the device
mobile_set_orientation - Change the screen orientation (portrait/landscape)

App Management

mobile_list_apps - List all installed apps on the device
mobile_launch_app - Launch an app using its package name
mobile_terminate_app - Stop and terminate a running app
mobile_install_app - Install an app from file (.apk, .ipa, .app, .zip)
mobile_uninstall_app - Uninstall an app using bundle ID or package name

Screen Interaction

mobile_take_screenshot - Take a screenshot to understand what's on screen
mobile_save_screenshot - Save a screenshot to a file
mobile_list_elements_on_screen - List UI elements with their coordinates and properties
mobile_click_on_screen_at_coordinates - Click at specific x,y coordinates
mobile_double_tap_on_screen - Double-tap at specific coordinates
mobile_long_press_on_screen_at_coordinates - Long press at specific coordinates
mobile_swipe_on_screen - Swipe in any direction (up, down, left, right)

Input & Navigation

mobile_type_keys - Type text into focused elements with optional submit
mobile_press_button - Press device buttons (HOME, BACK, VOLUME_UP/DOWN, ENTER, etc.)
mobile_open_url - Open URLs in the device browser

Platform Support

iOS: Simulators and real devices via native accessibility and WebDriverAgent
Android: Emulators and real devices via ADB and UI Automator
Cross-platform: Unified API works across both iOS and Android

🏗️ Mobile MCP Architecture

📚 Wiki page

More details in our wiki page for setup, configuration and debugging related questions.

Installation and configuration

Standard config works in most of the tools:

json

{
  "mcpServers": {
    "mobile-mcp": {
      "command": "npx",
      "args": ["-y", "@mobilenext/mobile-mcp@latest"]
    }
  }
}

To setup Cline, just add the json above to your MCP settings file.

Click the button to install:

Or install manually:

Go to Cursor Settings -> MCP -> Add new MCP Server. Name to your liking, use command type with the command npx -y @mobilenext/mobile-mcp@latest. You can also verify config or add command like arguments via clicking Edit.

Use the Gemini CLI to add the Mobile MCP server:

bash

gemini mcp add mobile-mcp npx -y @mobilenext/mobile-mcp@latest

Click the button to install:

Or install manually:

Go to Advanced settings -> Extensions -> Add custom extension. Name to your liking, use type STDIO, and set the command to npx -y @mobilenext/mobile-mcp@latest. Click "Add Extension".

Open Qodo Gen chat panel in VSCode or IntelliJ → Connect more tools → + Add new MCP → Paste the standard config above.

Click Save.

🛠️ How to Use 📝

After adding the MCP server to your IDE/Client, you can instruct your AI assistant to use the available tools. For example, in Cursor's agent mode, you could use the prompts below to quickly validate, test and iterate on UI intereactions, read information from screen, go through complex workflows. Be descriptive, straight to the point.

✨ Example Prompts

Workflows

You can specifiy detailed workflows in a single prompt, verify business logic, setup automations. You can go crazy:

Search for a video, comment, like and share it.

Find the video called " Beginner Recipe for Tonkotsu Ramen" by Way of
Ramen, click on like video, after liking write a comment " this was
delicious, will make it next Friday", share the video with the first
contact in your whatsapp list.

Download a successful step counter app, register, setup workout and 5-star the app

Find and Download a free "Pomodoro" app that has more than 1k stars.
Launch the app, register with my email, after registration find how to
start a pomodoro timer. When the pomodoro timer started, go back to the
app store and rate the app 5 stars, and leave a comment how useful the
app is.

Search in Substack, read, highlight, comment and save an article

Open Substack website, search for "Latest trends in AI automation 2025",
open the first article, highlight the section titled "Emerging AI trends",
and save article to reading list for later review, comment a random
paragraph summary.

Reserve a workout class, set timer

Open ClassPass, search for yoga classes tomorrow morning within 2 miles,
book the highest-rated class at 7 AM, confirm reservation,
setup a timer for the booked slot in the phone

Find a local event, setup calendar event

Open Eventbrite, search for AI startup meetup events happening this
weekend in "Austin, TX", select the most popular one, register and RSVP
yes to the event, setup a calendar event as a reminder.

Check weather forecast and send a Whatsapp/Telegram/Slack message

Open Weather app, check tomorrow's weather forecast for "Berlin", and
send the summary via Whatsapp/Telegram/Slack to contact "Lauren Trown",
thumbs up their response.

Schedule a meeting in Zoom and share invite via email

Open Zoom app, schedule a meeting titled "AI Hackathon" for tomorrow at
10AM with a duration of 1 hour, copy the invitation link, and send it via
Gmail to contacts "team@example.com".

More prompt examples can be found here.

Prerequisites

What you will need to connect MCP with your agent and mobile devices:

Xcode command line tools
Android Platform Tools
node.js v22+
MCP supported foundational models or agents, like Claude MCP, OpenAI Agent SDK, Copilot Studio

Simulators, Emulators, and Real Devices

When launched, Mobile MCP can connect to:

iOS Simulators on macOS/Linux
Android Emulators on Linux/Windows/macOS
iOS or Android real devices (requires proper platform tools and drivers)

Make sure you have your mobile platform SDKs (Xcode, Android SDK) installed and configured properly before running Mobile Next Mobile MCP.

Running in "headless" mode on Simulators/Emulators

When you do not have a real device connected to your machine, you can run Mobile MCP with an emulator or simulator in the background.

For example, on Android:

Start an emulator (avdmanager / emulator command).
Run Mobile MCP with the desired flags

On iOS, you'll need Xcode and to run the Simulator before using Mobile MCP with that simulator instance.

xcrun simctl list
xcrun simctl boot "iPhone 16"

Thanks to all contributors ❤️

We appreciate everyone who has helped improve this project.

Star History

Repository Owner

mobile-next

Organization

Repository Details

Language TypeScript

Default Branch main

Size 2,252 KB

Contributors 18

License Apache License 2.0

MCP Verified Nov 12, 2025

Programming Languages

TypeScript

94.86%

JavaScript

5.14%

Topics

agent android emulator ios mcp mobile physical real simulator

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Related MCPs

Discover similar Model Context Protocol servers

AutoMobile

Powerful tools for mobile automation, test authoring, and device management via MCP.

AutoMobile provides a comprehensive set of tools for mobile automation, focusing on UI testing and development workflow automation. It operates as an MCP Server, enabling a robust interaction loop for model-driven actions and observations. The solution supports Android platforms with features like automated test authoring, multi-device management, and seamless CI test execution. AutoMobile also offers source mapping and deep view hierarchy analysis to enhance code rendering accuracy.

⭐ 63
MCP
zillow/auto-mobile

HarmonyOS MCP Server

Enables HarmonyOS device manipulation via the Model Context Protocol.

HarmonyOS MCP Server provides an MCP-compatible server that allows programmatic control of HarmonyOS devices. It integrates with tools and frameworks such as OpenAI's openai-agents SDK and LangGraph to facilitate LLM-powered automation workflows. The server supports execution through standard interfaces and can be used with agent platforms to process natural language instructions for device actions. Its design allows for seamless interaction with HarmonyOS systems using the Model Context Protocol.

⭐ 25
MCP
XixianLiang/HarmonyOS-mcp-server

ScreenPilot

Empower LLMs with full device control through screen automation.

ScreenPilot provides an MCP server interface to enable large language models to interact with and control graphical user interfaces on a device. It offers a comprehensive toolkit for screen capture, mouse control, keyboard input, scrolling, element detection, and action sequencing. The toolkit is suitable for automation, education, and experimentation, allowing AI agents to perform complex operations on a user’s device.

⭐ 50
MCP
Mtehabsim/ScreenPilot

omniparser-autogui-mcp

Automated GUI analysis and interaction via the Model Context Protocol.

omniparser-autogui-mcp is an MCP server that leverages OmniParser to analyze on-screen content and perform automated GUI operations. It integrates with clients such as Claude Desktop and can be configured via a detailed environment setup. The tool supports Windows and can delegate OmniParser processing to external devices, offering flexibility for complex contexts. Multiple environment variables allow customization of backend processing, target window selection, and communication methods, including SSE.

⭐ 58
MCP
NON906/omniparser-autogui-mcp

JADX-AI-MCP

Automated AI-powered APK analysis via Model Context Protocol.

JADX-AI-MCP is a fully automated server and plugin for integrating Model Context Protocol (MCP) with JADX for the purpose of analyzing Android APKs using large language models such as Claude. It streamlines vulnerability discovery, reverse engineering, and static analysis by leveraging LLMs in conjunction with established tools. The project facilitates real-time code review and efficient collaboration between AI and human analysts.

⭐ 637
MCP
zinja-coder/jadx-ai-mcp

Xcode MCP Server

Comprehensive Xcode integration server for AI assistants using the Model Context Protocol.

Xcode MCP Server provides an MCP-compliant interface for AI agents to interact with Xcode projects on macOS. It supports project management, simulator control, CocoaPods and Swift Package Manager integration, and advanced file and build operations. Enhanced error handling and multi-project support enable seamless automation and context management for complex Xcode workflows.

⭐ 330
MCP
r-huijts/xcode-mcp-server

View all Alternatives

Didn't find tool you were looking for?

Search AI Tools

Mobile MCP

Key Features

Use Cases

README

Mobile Next - MCP server for Mobile Development and Automation | iOS, Android, Simulator, Emulator, and Real Devices

🚀 Mobile MCP Roadmap: Building the Future of Mobile

Main use cases

Main Features

🔧 Available MCP Tools

Device Management

App Management

Screen Interaction

Input & Navigation

Platform Support

🏗️ Mobile MCP Architecture

📚 Wiki page

Installation and configuration

Click the button to install:

Or install manually:

Click the button to install:

Or install manually:

🛠️ How to Use 📝

✨ Example Prompts

Workflows

Prerequisites

Simulators, Emulators, and Real Devices

Running in "headless" mode on Simulators/Emulators

Thanks to all contributors ❤️

We appreciate everyone who has helped improve this project.

Star History

Repository Owner

Repository Details

Programming Languages

Tags

Topics

Related MCPs