Search code examples
c#base64memorystreampdf-reader

Solved: C# MemoryStream converted to byte array, then to base64 string removes RTL characters


I am adding page numbers to a pdf file,

It works correctly with english, but when I try to add hebrew text it ommits those letters.

I assume the problem is with the encoding to base64, how do I solve this?

Code Example

byte[] myBinary = File.ReadAllBytes(path);
using (var reader = new PdfReader(myBinary))
{
    using (var ms = new MemoryStream())
    {
        using (var stamper = new PdfStamper(reader, ms))
        {
            int PageCount = reader.NumberOfPages;
            for (int i = 1; i <= PageCount; i++)
            {
                ColumnText.ShowTextAligned(stamper.GetUnderContent(i),
        Element.ALIGN_CENTER, new Phrase(String.Format("{0} מתוך {1}", i, PageCount)), 297f, 15f, 0);
            }
        }
        myBinary = ms.ToArray();
    }
}
string base64EncodedPDF = System.Convert.ToBase64String(myBinary);
return base64EncodedPDF;

In the front all I do is download the file.

 $scope.open_letter = function (letter) {
   var _letter = myService.PrintLetter().then(function (data) {
       var pdfAsDataUri = "data:application/pdf;base64," + data.data;
       var a = document.createElement("a");
       a.href = pdfAsDataUri;
       a.download = "מכתב" + ".pdf";
       a.click();
});
}

The reason I am asking this question is because in English it works perfectly, but it just ommits the Hebrew letters, which is interesting- I would assume it would replace it with weird characters.


Solution

  • So I finally managed to solve this issue.

    The problem was not that I was missing a font, but I wasn't sending one at all to the new Phrase function.

    I guess it knows what to do with the english letters, but not with the hebrew ones.

    What I did was this:

    BaseFont bf = BaseFont.CreateFont("c:/windows/Fonts/GISHA.ttf", BaseFont.IDENTITY_H, BaseFont.NOT_EMBEDDED);
    Font f= new Font(bf, 8, Font.NORMAL, BaseColor.BLACK);
    

    and then in my loop for page numbers, I did this:

    int PageCount = reader.NumberOfPages;
    for (int i = 1; i <= PageCount; i++)
    {
       ColumnText.ShowTextAligned(stamper.GetUnderContent(i),
       Element.ALIGN_CENTER, new Phrase(String.Format("{1} ךותמ {0}", i, PageCount), f), 297f, 15f, 0);
     }
    

    Which solved my issue, and now it works beautifully.

    proof