i have a primitive short plain-text file (*.txt) containing a narrative text written in geman, with words, sentences, usual punctuation, numbers, and also the EUR currency symbol () in it. i import it with
Import["test.txt", "Text"]
, and then i notice that the resulting string lacks all instances of the Euro symbol. Apparently they got replaced by a space char, in other words: deleted!, blanked!, swallowed!, skipped!, omitted!, not imported!, not properly imported!, gone!, overriden!, etc.
FYI the symbol is a standard key on any geman layout keyboard, it is invoked through pressing <Alt Gr>+<E>, it is also allowed as char in file names (Windows OS) and it cannot be regarded as special char such as: <>|":/?*\ The currency symbol is part of the keyboard alphabet, if you will! At least in the Europe.
Maybe there is a normal explanation and a workaround for the observed behavior, e.g. an option to Import[] or general settings/preferences. Or maybe it could be considered a bug? Because, interestingly, when i import the text file with
Import["test.txt", "HTML"]
, the resulting string does contain all instances of the symbol, as expected. By reporting this observation i am glad to have helped raise public awareness and make the responsibles improve the software :-P
Attachments: