Automating Chrome Browser from C#

Chrome Browser supports less known API called Remote Debugging. It is a JSON/WebSockets based protocol, which is basically used by Chrome Developer Tools to attach to a remote web page.

Note:

You can read more about this protocol at official page http://blog.chromium.org/2011/05/remote-debugging-with-chrome-developer.html

Setting up environment

To enable Remote Debugging, you need to start chrome with a parameter, enabling Chrome to listen on a specified Tcp port:

chrome.exe --remote-debugging-port=9222

Also, please ensure, that you dont have firewall blocking traffic to that port.

Once, you have Remote debugging enabled in Chrome, to check whether everything is working fine, we might want to start another Chrome session. This time, it has to be a session for a different user. To start chrome under different user, please start another chrome with these parameters:

chrome.exe --user-data-dir=C:\myChromeUser

You might want to change the directory to some other temp directory of client user.

Once you start client Chrome for the different user, you just go to the url:

http://localhost:9222

Which will basically open a webpage, offering you to select which tab in Chrome you want to debug remotely.

The page might look like:

Note: You should been opening this page with Chrome, because other browsers does not support Remote Debugging.

Selecting a Page, you should have Debugging Tools open for the page. It looks similar to the local Debugging Tools:

Then, just to try simple command, we might want to test alert being invoked in the debugged browser, simply switch to console and like on a local environment use:

alert('Hello World');

That was the simple test.

Lets return to automation. Chrome Remote Debugging page describes the protocol as JSON based, with some examples. I wanted to catch some traffic flowing from Remote Debugging Chrome page going to debugged Chrome. To that, I have used Fiddler2 Http Proxy, once I set that to my system proxy, I was able to catch the initial protocol flow. The rest of the protocol flow goes over WebSockets.

WebSockets in place

When you look at the result in fiddler, (or simply use your browser and go to http://localhost:9222/json. Note: When you dont see WebSocketDebuggerUrl in response, you are probably connected to RemoteDebugging endpoint with a different client) you will see something like:

[ {
   "devtoolsFrontendUrl": "/devtools/devtools.html?host=localhost:9222&page=1",
   "faviconUrl": "http://s2.wp.com/i/favicon.ico?m=1311976027g",
   "thumbnailUrl": "/thumb/https://markcz.wordpress.com/",
   "title": "Martin Kunc's blog",
   "url": "https://markcz.wordpress.com/",
   "webSocketDebuggerUrl": "ws://localhost:9222/devtools/page/1"
} ]

As you can see, this is pure Json. Anyway, what we are interested in the most is

   "webSocketDebuggerUrl": "ws://localhost:9222/devtools/page/1"

This element points to WebSockets endpoint, where Chrome is willing to listen for Json formatted requests.

The problem here is that currently Fiddler does not have nice support for WebSockets. Even if they goes over same ports, they are socket based. Fiddler beta has preliminary support for it in its Log tab, but its not so fancy formatted.

WebSockets in .NET

I had troubles with looking for a nice WebSockets Client implementation and then finally I found WebSocket4Net, which is WebSocket client implementation primarly used for SuperWebSocket server, but works pretty nice here.

Preparing Json commands for Remote Debugging protocol

Using the fiddler and its Log tab I was able to see enough samples of Json commands, which Remote DP is using. It is also described here. I have used v 0.1 of it, because I currently have Chrome 17.

Finally driving Chrome

Here is my snippet how I am driving Chrome in C#:

var chrome = new Chrome("http://localhost:9222");
var sessions = chrome.GetAvailableSessions();
Console.WriteLine("Available debugging sessions");
foreach (var s in sessions)
  Console.WriteLine(s.url);

if (sessions.Count == 0)
  throw new Exception("All debugging sessions are taken.");

// Will drive first tab session
var sessionWSEndpoint = sessions[0].webSocketDebuggerUrl;
chrome.SetActiveSession(sessionWSEndpoint);
chrome.NavigateTo("http://www.google.com");

var result = chrome.Eval("document.getElementById('lst-ib').value='Hello World'");
result = chrome.Eval("document.forms[0].submit()");

Console.ReadLine();

Thats basically all, it is just a proof of concept application.

Sources are available on the GitHub link: https://github.com/markcz/automate-chrome

21 thoughts on “Automating Chrome Browser from C#”

Would be nice to have a VS 2010 version.

Hi ,
This is what I wanted to work upon.

Could you please share the code on git-hub since I am not able to access the link ?
Thanks!

Martin Kunc says:

September 11, 2013 at 12:48

I will, thanks for the notice !

Reply
Martin Kunc says:

September 12, 2013 at 09:27

HI, the git-hub link is now attached in the bottom of article.

Reply

WP turned the two characters “–” into a single “–” – threw me for a loop initially when trying to execute “chrome.exe –remote-debugging-port=9222”

Michael says:

July 16, 2014 at 18:29

Well, it did it again: should be “- -” without the spaces, NOT “–”

Reply
1. Martin Kunc says:
  
  July 17, 2014 at 00:16
  
  Thank you Michael, I have updated the command line fragments, code listings and the links more readable form. I hope now it is more readable.

I’m trying it in java..Do you have same solution using java?

Martin Kunc says:

September 11, 2014 at 08:15

Hi Dhashi, I havent tried using Java for this, but there is a Google Java project here https://code.google.com/p/chromedevtools/wiki/ChromeDevToolsSdk which should handle remote connections to Chrome. The project is Called WIP. Hope this helps.

Reply
1. dhashi says:
  
  September 11, 2014 at 08:22
  
  Hi Martin, yeah i saw that but it mentioned as “The project is no longer actively maintained” in the project home, so i didn’t consider that.
2. Martin Kunc says:
  
  September 11, 2014 at 20:31
  
  Ok. Actually, If I would go for browser automation today, I would choose Selenium and WebDriver protocol. This post was written at times, when these were still at starts.
  You can use Selenium framework to abstract you from browser differences and automate FF/IE/Chrome (even at once). See http://www.youtube.com/watch?v=WSTSypF_3XI for demo, http://docs.seleniumhq.org/docs/appendix_installing_java_driver_client.jsp for configuration and for example this page for another sample: https://code.google.com/p/selenium/wiki/GettingStarted.
  Actually, Selenium WebDriver protocol has gone a long way since the beginning it become a standard and its now being natively supported in new IE.

Thanks martin, i’ll check.

monitoramento@ronsoft.com.br says:

January 30, 2016 at 00:15

Hey Martin..
In your sample, how could we save a image from disk ? Is it possible using Jason ?

I’m using your project and it`s working perfectly..

Thanks

Reply
1. Martin Kunc says:
  
  January 30, 2016 at 09:55
  
  Hi dhashi,
  It should be possible with latest Chrome Remote debugging protocol using:
  Page.captureScreenshot
  
  See http://stackoverflow.com/a/27376634 for a snippet or Official documentation: https://chromedevtools.github.io/debugger-protocol-viewer/Page/
  Martin

Pingback: Automate Safari web browser using c# on Windows | XL-UAT

Hi Martin.
In your example, what i ‘d have to do to retrive fields values ?

For example, if i want to return the value of document.getElementById(‘lst-ib’) to my C# application, what could i do ?

Thanks

Pingback: Connecting/Attaching application to browser? - HTML CODE

Pingback: Automate Safari web browser using c# on Windows [ANSWERED] - Tech ABC to XYZ

Hi
I’m using this soluation to make automation chrome.
But i have little tiny error on dispatchTouchEvent.
I tried sending touch event like this json below

{“method”:”Input.dispatchTouchEvent”,”params”:{“type”:”touchStart”,”touchPoints”:[{“x”: 219, “y”:84}]},”id”:1}

The result looks fine…
{“id”:1,”result”:{}}

After that i send TouchEnd json
{“method”:”Input.dispatchTouchEvent”,”params”:{“type”:”touchEnd”,”touchPoints”:[]},”id”:1}

But result is not ok
{“error”:{“code”:-32602,”message”:”Must send a TouchStart first to start a new touch.”},”id”:1}

I put delay between these code, But doesn’t work,

Anybody knows why this error occur??? I spend this almost three days to solve this. ;((((((

Pingback: Automate Safari web browser using c# on Windows

Martin Kunc says:

July 18, 2023 at 03:50

Hi, Thanks Steve

Reply

Gyula Andor says:

September 8, 2012 at 08:26

Would be nice to have a VS 2010 version.

renderinglife says:

September 10, 2013 at 11:48

Hi ,
This is what I wanted to work upon.

Could you please share the code on git-hub since I am not able to access the link ?
Thanks!

1. Martin Kunc says:
  
  September 11, 2013 at 12:48
  
  I will, thanks for the notice !
  
2. Martin Kunc says:
  
  September 12, 2013 at 09:27
  
  HI, the git-hub link is now attached in the bottom of article.
  
Michael says:

July 16, 2014 at 18:28

WP turned the two characters “–” into a single “–” – threw me for a loop initially when trying to execute “chrome.exe –remote-debugging-port=9222”

1. Michael says:
  
  July 16, 2014 at 18:29
  
  Well, it did it again: should be “- -” without the spaces, NOT “–”
  
  1. Martin Kunc says:
    
    July 17, 2014 at 00:16
    
    Thank you Michael, I have updated the command line fragments, code listings and the links more readable form. I hope now it is more readable.
dhashi says:

September 9, 2014 at 14:39

I’m trying it in java..Do you have same solution using java?

1. Martin Kunc says:
  
  September 11, 2014 at 08:15
  
  Hi Dhashi, I havent tried using Java for this, but there is a Google Java project here https://code.google.com/p/chromedevtools/wiki/ChromeDevToolsSdk which should handle remote connections to Chrome. The project is Called WIP. Hope this helps.
  
  1. dhashi says:
    
    September 11, 2014 at 08:22
    
    Hi Martin, yeah i saw that but it mentioned as “The project is no longer actively maintained” in the project home, so i didn’t consider that.
  2. Martin Kunc says:
    
    September 11, 2014 at 20:31
    
    Ok. Actually, If I would go for browser automation today, I would choose Selenium and WebDriver protocol. This post was written at times, when these were still at starts.
    You can use Selenium framework to abstract you from browser differences and automate FF/IE/Chrome (even at once). See http://www.youtube.com/watch?v=WSTSypF_3XI for demo, http://docs.seleniumhq.org/docs/appendix_installing_java_driver_client.jsp for configuration and for example this page for another sample: https://code.google.com/p/selenium/wiki/GettingStarted.
    Actually, Selenium WebDriver protocol has gone a long way since the beginning it become a standard and its now being natively supported in new IE.
dhashi says:

September 12, 2014 at 05:56

Thanks martin, i’ll check.

1. monitoramento@ronsoft.com.br says:
  
  January 30, 2016 at 00:15
  
  Hey Martin..
  In your sample, how could we save a image from disk ? Is it possible using Jason ?
  
  I’m using your project and it`s working perfectly..
  
  Thanks
  
  1. Martin Kunc says:
    
    January 30, 2016 at 09:55
    
    Hi dhashi,
    It should be possible with latest Chrome Remote debugging protocol using:
    Page.captureScreenshot
    
    See http://stackoverflow.com/a/27376634 for a snippet or Official documentation: https://chromedevtools.github.io/debugger-protocol-viewer/Page/
    Martin
Pingback: Automate Safari web browser using c# on Windows | XL-UAT
monitoramento@ronsoft.com.br says:

July 24, 2015 at 22:38

Hi Martin.
In your example, what i ‘d have to do to retrive fields values ?

For example, if i want to return the value of document.getElementById(‘lst-ib’) to my C# application, what could i do ?

Thanks

Pingback: Connecting/Attaching application to browser? - HTML CODE
Pingback: Automate Safari web browser using c# on Windows [ANSWERED] - Tech ABC to XYZ
dongjuparkcom says:

July 18, 2018 at 08:07

Hi
I’m using this soluation to make automation chrome.
But i have little tiny error on dispatchTouchEvent.
I tried sending touch event like this json below

{“method”:”Input.dispatchTouchEvent”,”params”:{“type”:”touchStart”,”touchPoints”:[{“x”: 219, “y”:84}]},”id”:1}

The result looks fine…
{“id”:1,”result”:{}}

After that i send TouchEnd json
{“method”:”Input.dispatchTouchEvent”,”params”:{“type”:”touchEnd”,”touchPoints”:[]},”id”:1}

But result is not ok
{“error”:{“code”:-32602,”message”:”Must send a TouchStart first to start a new touch.”},”id”:1}

I put delay between these code, But doesn’t work,

Anybody knows why this error occur??? I spend this almost three days to solve this. ;((((((

Pingback: Automate Safari web browser using c# on Windows
1. Martin Kunc says:
  
  July 18, 2023 at 03:50
  
  Hi, Thanks Steve

Automating Chrome Browser from C#

Setting up environment

WebSockets in place

WebSockets in .NET

Preparing Json commands for Remote Debugging protocol

Finally driving Chrome

Published by Martin Kunc

21 thoughts on “Automating Chrome Browser from C#”

Leave a reply to Martin Kunc Cancel reply

Setting up environment

WebSockets in place

WebSockets in .NET

Preparing Json commands for Remote Debugging protocol

Finally driving Chrome

Share this:

Related

Published by Martin Kunc

21 thoughts on “Automating Chrome Browser from C#”

Leave a reply to Martin Kunc Cancel reply