Skip to content

danielrosehill/Browser-Use-Agent-GUI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Browser Use Agent GUI (Linux)

alt text

This repository implements a simple GUI intended to facilitate working with Browser Use in a Linux computer environment.

Browser Use is an incredible project for facilitating agentic control over a web browser.

This GUI simply provides a minimal Python wrapper over the app and has been validated to work on Open SUSE Tumbleweed Linux using KDE Plasma as the desktop environment.

How To Use

Firstly, configure an API credential by copying .env.example to .env and providing your OpenAI API key.

Next, enter your prompt as the task description.

alt text

Hit the "Go" button!

Be prepared to be amazed.

Recommendation: Start with a very modest task to get a feel for how the program works.

The project's website has some suggestions around use-cases.

Stopping The Agent

Using Browser Use entails what it says on the tin: giving an LLM not only programmatic access to your computer, but the ability to directly manipulate a GUI.

While its potential utility is vast, equally there is great potential for destruction! Close supervision is highly recommended, at least during the trial and error period.

To facilitate aborting the agent and for the amusement factor, a conspicuous stop button has been added, as well as a stop context menu in the system tray icon.

alt text

Minimal Implementation

About Browser Use

https://browser-use.com/

https://github.com/browser-use/browser-use

About

Linux GUI for initiating and monitoring Browser Use with an exit switch

Topics

Resources

Stars

Watchers

Forks

Languages