You need to use iText 7. Please check below code snippet-
// iText imports
import com.itextpdf.kernel.pdf.PdfDocument;
import com.itextpdf.kernel.pdf.PdfReader;
import com.itextpdf.kernel.pdf.canvas.parser.PdfTextExtractor;
public class HindiMarathiText {
@Test
public void go() throws Exception {
try (PdfDocument doc = new PdfDocument(new PdfReader("input.pdf"))) {
try (OutputStream os = new FileOutputStream("output.txt")) {
String result = PdfTextExtractor.getTextFromPage(doc.getPage(3));
os.write(result.getBytes(Charset.forName("UTF-16")));
}
}
}
}
Note:
-----------------------------
You need to build iText 7 from source (https://github.com/itext/itext7) to achieve the above quality. This functionality is available in iText 7.0.2 release. You can have a look here:
iText 7.0.2 release Note- Click Here
Some useful references:
--------------------------------------------
iText7 pdfcalligraph- Click Here
iText Language Specific Examples- Click Here