Ignore accumulated Byte Order Markers (BOM)

**Aaron** · 12-Mar-2018, 01:53 PM

Hello,

Our Text Compare will show the Encoding in the status bar, but the BOM itself is not in the comparison text below. A rules-based scan would normally ignore this information. Is there also hex or binary characters inserted in the Text Compare's main text pane? If you enable the View menu -> Hex Details, what does it show for the information on the first line? Could we see a full screen screenshot? You can post here or email at [email protected] along with a link back to this forum thread.

Generally, you are correct, you'd define an unimportant grammar RegEx with \x which can define a hex code.
http://www.scootersoftware.com/suppo..._unimportantv3
But you would want to verify the Hex info you are trying to ignore is actually represented in the main text pane.

**lbeazley** · 12-Mar-2018, 03:19 PM

Thank you for your time (and the great tool)
I attached a screenshot and an archive of the example files used in the screenshot.
The files in the archive are of type .uni which are UTF-16LE Text Files
I should add:
File Example1.uni is an example of an accumulation of two additional pairs of BOC
File Example2.uni is an example of an accumulation of one additional pair of BOC.
A normal file would only have one pair

Attached Files

BC4.zip (364 Bytes, 220 views)

**Aaron** · 12-Mar-2018, 04:32 PM

Hello,

The quick answer is Little Endian requires inverting the \x{NNNN} sequence, so it'd be:
\x{FEFF}

The another method is you could select the literal blank character(s) by placing the cursor just left of the first /, then shift+arrow to select the invisible characters (which you can see Selecting in the View menu -> Hex Details below), Copy to clipboard, and then in the Session Settings, Importance tab, create a new Unimportant element and Paste this invisible, literal character in. Make sure it *isn't* Regular Expression, and click ok to ignore the literal (invisible) character from the clipboard.

Ignore accumulated Byte Order Markers (BOM)

Ignore accumulated Byte Order Markers (BOM)

Comment

Comment

Comment