Trustwave and Cybereason Merge to Form Global MDR Powerhouse for Unparalleled Cybersecurity Value. Learn More

Trustwave and Cybereason Merge to Form Global MDR Powerhouse for Unparalleled Cybersecurity Value. Learn More

Services
Managed Detection & Response

Eliminate active threats with 24/7 threat detection, investigation, and response.

Co-Managed SOC (SIEM)

Maximize your SIEM investment, stop alert fatigue, and enhance your team with hybrid security operations support.

Advisory & Diagnostics

Advance your cybersecurity program and get expert guidance where you need it most.

Penetration Testing

Test your physical locations and IT infrastructure to shore up weaknesses before exploitation.

Database Security

Prevent unauthorized access and exceed compliance requirements.

Email Security

Stop email threats others miss and secure your organization against the #1 ransomware attack vector.

Digital Forensics & Incident Response

Prepare for the inevitable with 24/7 global breach response in-region and available on-site.

Firewall & Technology Management

Mitigate risk of a cyberattack with 24/7 incident and health monitoring and the latest threat intelligence.

Solutions
BY TOPIC
Microsoft Security
Unlock the full power of Microsoft Security
Offensive Security
Solutions to maximize your security ROI
Rapidly Secure New Environments
Security for rapid response situations
Securing the Cloud
Safely navigate and stay protected
Securing the IoT Landscape
Test, monitor and secure network objects
Why Trustwave
About Us
Awards and Accolades
Trustwave SpiderLabs Team
Trustwave Fusion Security Operations Platform
Trustwave Security Colony
Partners
Technology Alliance Partners
Key alliances who align and support our ecosystem of security offerings
Trustwave PartnerOne Program
Join forces with Trustwave to protect against the most advance cybersecurity threats
Loading...
Loading...

HOWTO: Enabling the OCR feature in MailMarshal

Expand / Collapse


This article applies to:

  • Trustwave MailMarshal SEG (8.2.X and above)
  • Optical Character Recognition in image files

Question:

  • How do I enable the OCR feature in MailMarshal?

Background:

As of July 7, 2021, Trustwave provides an OCR feature in MailMarshal 8.2.X and above. This feature attempts to extract text from image files. The extracted text is scanned by MailMarshal layers such as the spam and malware detection and user supplied TextCensor scripts.

Due to performance considerations this feature is not enabled by default.

Prerequisites:

This feature requires a processor with AVX (Advanced Vector Extensions) support. Recommended processors (i3 and above) do support AVX.

Attempting to use this feature on a system without AVX support will result in services failing.

To check your system, you can use the free software HWiNFO (installer or portable version). Check the features panel or report for AVX support.

Procedure: 

To enable the feature:

  1. On the Array Manager server, navigate to the Config subfolder in the SEG installation.
  2. Ensure that the files tensorflow.dll  and tessdata.zip are present. These file are provided through automatic updates from a CDN location.
  3. Make a copy of the file unpacker.config.json
    • Name the copy override.unpacker.config.json 
  4. Edit override.unpacker.config.json and locate the line "MaxImageCount": 0,
    • Change the line to read   "MaxImageCount": 50,
    • Be sure to preserve the comma, quotes, and case of the text. Invalid formatting can cause the SEG services to stop.
  5. Save the file and commit configuration. Restart the Engine service.
  6. Monitor the Content Analysis log or Engine log for text files named ImageExtracted.txt
    • In debug logging additional information is logged, such as:
      • Unpacking image <filename> consumed nnnn milliseconds
      • Image OCR: timeout limit exceeded
  7. If image extraction is often timing out, you can change the "Timeout" setting in override.unpacker.config.json (values in milliseconds). However, you should use caution to avoid excessive delay and resource usage.

Notes:

The json file also contains other advanced settings. Trustwave recommends against making other changes.


To contact Trustwave about this article or to request support:


Rate this Article:
     

Add Your Comments


Comment submission is disabled for anonymous users.
Please send feedback to Trustwave Technical Support or the Webmaster
.