getPageNumWords in Acrobat SDK is not detecting words with '-' and '_'
we using acrobat sdk identifying , linking words on our pdf files. using getpagenumwords
method number of words , using quad , rect word link it.
(int page = 0; page < numpages; page++) { //for each page // number of words object objnumwords = comutils.invokemethod(jso, "getpagenumwords", page); if (objnumwords == null) throw new pdfprocessingexception("acrobat api error. cannot access doc.getpagenumwords()"); int numwords = convertutils.getint(objnumwords); //other logic goes here }
when there word abcd-efgh
or abcd_efgh
in pdf file. above method returns them abcd
, efgh
instead of 1 word.
is bug in acrobat sdk or not using intended?
btw using acrobat sdk 1.1
what missing.
thanks,
tippu
this expected result when using method (maybe not expected, it's how works).
More discussions in Acrobat SDK
adobe
Comments
Post a Comment