Get text from inside google chrome using my c# app

Question 1

I am writing a small app that will among other things expand shortcuts into full text while typing. example: the user writes "BNN" somewhere and presses the relevant keyboard combination, the app would replace the "BNN" with a "Hi I am Banana".

after some research i learned that it can be done using user32.dll and the process of achieving this task is as follows:

1) get the active window handle
2) get the active window thread handle
3) attach input to active thread
4) get focused control handle (+caret position but that is not the issue)
5) detach input from active thread
6) get the text from the focused control using its handle

and here is my code so far:

try
{
 IntPtr activeWindowHandle = GetForegroundWindow();
 IntPtr activeWindowThread = GetWindowThreadProcessId(activeWindowHandle, IntPtr.Zero);
 IntPtr thisWindowThread = GetWindowThreadProcessId(this.Handle, IntPtr.Zero);
 AttachThreadInput(activeWindowThread, thisWindowThread, true);
 IntPtr focusedControlHandle = GetFocus();
 AttachThreadInput(activeWindowThread, thisWindowThread, false);
 if (focusedControlHandle != IntPtr.Zero)
 {
 TB_Output.Text += focusedControlHandle + " , " + GetText(focusedControlHandle) + Environment.NewLine;
 }
}
catch (Exception exp)
{
 MessageBox.Show(exp.Message);
}
//...
//...
[DllImport("user32.dll", CharSet = CharSet.Auto, ExactSpelling = true)]
internal static extern IntPtr GetForegroundWindow();
[DllImport("user32.dll", CharSet = CharSet.Auto, SetLastError = true)]
internal static extern int GetWindowThreadProcessId(int handle, out int processId);
[DllImport("user32", CharSet = CharSet.Ansi, SetLastError = true, ExactSpelling = true)]
internal static extern int AttachThreadInput(IntPtr idAttach, IntPtr idAttachTo, bool fAttach);
[DllImport("user32.dll", CharSet = CharSet.Auto, ExactSpelling = true)]
internal static extern IntPtr GetFocus();

this works perfectly for some windows forms apps but it doesnt work with WPF nor browsers, just gives me the title of the WPF app or the title of the tab in chrome.

if i run the app on this page while typing this question for instance, instead of the content of the question, the text i get is:

Get text from inside google chrome using my c# app - Stack Overflow - Google

probably because they use graphics to render the elements, and im not sure how i can get to the active element and read it's text.

i only referred to web browsers in the question's title because this tool will be mostly used with web browsers.

thank you in advance for any feedback.

Question 2

Not sure if it is the best approach, I would go developer.chrome.com/extensions/devguide It is doable imho, but hooking into the web browser could trigger AV software like hell.

Question 3

@bradbury9 i considered making an extension but it causes too many problems, the main one being that this tool will be used mostly with chrome but not only, so i cant restrict it to a chrome extension. or any other browser extension actually. +its easier to maintain and update as an app if i install it to my whole company...

Question 4

@bradbury9 arranging an exception in our overly protective anti virus is not a problem.

Question 5

If you want to do that in web browsers and WPF apps, you will have to create a keylogger that constantly monitors the keyboard and replaces the text simulating the keyboard input. WPF controls have no Windows handles, so WinAPI is useless for them. Same for the controls rendered in the web browsers.

Question 6

@dymanoid thanks for the input, technically my app already is a keylogger as it monitors for the combination of keys that triggers the expanding. I am aware unfortunately that browsers and WTF window controsl have no handles (since they are technically graphical objects), but maybe there is a creative way of achieving this? spell checkers do manage to do it somehow, why cant we?

Question 7

I would personally attempt to create a library which chrome prefers. There are many available such as Kantu, which is specialized for Chrome.

Examples: TestCafe, Watir, SlimerJS

Question 8

I think that library is not the optimal way to do what you want. I would use a library more suited to browser DOM manipulation, like Selenium.

foyss 9932 gold badges8 silver badges24 bronze badges · Accepted Answer · 2018-07-06 12:35:53Z

3

I would personally attempt to create a library which chrome prefers. There are many available such as Kantu, which is specialized for Chrome.

Examples: TestCafe, Watir, SlimerJS

Share

Improve this answer

edited Nov 3, 2018 at 18:50

Alex Skorkin's user avatar

Alex Skorkin

4,2743 gold badges27 silver badges48 bronze badges

answered Jul 6, 2018 at 12:35

foyss's user avatar

foyss

9932 gold badges8 silver badges24 bronze badges

Sign up to request clarification or add additional context in comments.

CollectivesTM on Stack Overflow

Get text from inside google chrome using my c# app

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related