getPageNumWords in Acrobat SDK is not detecting words with '-' and '_'


we using acrobat sdk identifying , linking words on our pdf files. using getpagenumwords method number of words , using quad , rect word link it.

    (int page = 0; page < numpages; page++) { //for each page // number of words object objnumwords = comutils.invokemethod(jso, "getpagenumwords", page); if (objnumwords == null) throw new pdfprocessingexception("acrobat api error. cannot access doc.getpagenumwords()"); int numwords = convertutils.getint(objnumwords); //other logic goes here } 

when there word abcd-efgh or abcd_efgh in pdf file. above method returns them abcd , efgh instead of 1 word.

is bug in acrobat sdk or not using intended?

btw using acrobat sdk 1.1

what missing.

 

thanks,

tippu

this expected result when using method (maybe not expected, it's how works).



More discussions in Acrobat SDK


adobe

Comments

Popular posts from this blog

Soustraire une selection

Illustrator cs6 "Invalid Serial Number"

After Effects: could not find dvaeve_dialogs.txt