tracker issue : CF-4203233

select a category, or use search below
(searches all categories and all time range)
Title:

cfpdf action="extract" considerable slower

| View in Tracker

Status/Resolution/Reason: To Fix//HaveNewInfo

Reporter/Name(from Bugbase): Matthew Clemente / ()

Created: 08/05/2018

Components: Performance

Versions: 13.0

Failure Type: Performance Issue

Found In Build/Fixed In Build: 2018,0,0,310739 /

Priority/Frequency: Major / All users will encounter

Locale/System: / Other

Vote Count: 0

Problem Description: Extracting text using this version is considerably slower than earlier versions; more than twice as slow.

Steps to Reproduce: I downloaded and used this PDF for testing: https://www.planetebook.com/free-ebooks/1984.pdf

I ran the following with versions 10, 11, 2016, and 2018.
___
start = getTickCount();
cfpdf( action = 'extracttext', source = expandPath( 'path-to-pdf' ), name = 'text', useStructure = true, honourspaces = true );
duration_in_milliseconds = getTickCount() - start;
writeOutput('#duration_in_milliseconds / 1000#');
___

Actual Result: ColdFusion 2018 was more than twice as slow as any other version.

Expected Result: The time would be the same for the latest version.

Any Workarounds: Using a Java Library like PDFBox for text extraction.

Attachments:

Comments: