Mozilla Connect

AlexK · ‎27-07-2025

MCP servers are getting more popular. However they lack ability of visual interaction.

I suggest creating a dedicated api which will describe all available screen features and ways to call them.

For example I am watching a youtube video.
Current activity shows the contend and declares what it can do to an LLM in a format like
[{
"description": "rewind video on integer number of seconds, where positive means forward and negative means backwards",
"callback": callback,
},
{
"description": "stop video", ...
},
{
"description": "find video by search term" ...
}...]

Using this interface anyone can create an app which could be used hands free via voice commands or via autonomous assistant.

Jon · ‎28-07-2025

Thanks for submitting an idea to the Mozilla Connect community! Your idea is now open to votes (aka kudos) and comments.

AlexNahas · ‎30-07-2025

This is the thought behind. /WebMCP

Come check it out. It’s all open source and contributors are welcome

AlexNahas · ‎30-07-2025

Come check out WebMCP! https://github.com/MiguelsPizza/WebMCP

It’s exactly as you describe

Mozilla Connect

MCP interface web for pages

Could you add Ukrainian language to the Translate ...

Copy link to highlight

Preview groups by hovering

Suggestion for the new PDF in Firefox 2023 - signa...

Sticky Notes in pdf viewer

Customizable hotkeys

Synced WORK SPACES (or Synced Tab Groups)

Mobile tab grouping

Bring back PWA (progressive web apps)

Support Casting