Ok , so you aren't getting it from Patstat online but on datasets that have been supplied possibly on a memory stick. Try Using Encoding="unicode" option in Infile statement (as in this thread) I've copied below from Patstat documentation which describes the data that I think you are using 2.1. Data files You will find the data files in the folder Data. For every table there is one or more zipped data file(s). To keep all files (if unzipped) smaller than 2 GB, the data of the large tables are split into multiple files. Sometimes users report problems unzipping some files when using WinZip. In these cases it is advisable to try a different unpacking tool like the open source tool 7-zip. After unzipping, each file contains one header record comprising the column names of the table, followed by multiple data records. All characters are in Unicode with UTF-8 encoding The files are in MS-DOS format (i.e. each line ends with CR/LF) Each value is delimited by a comma "," Each string / text value is enclosed in double quotes, like: "Smith, John". Within string values there are never double quotes. All non-text values (numbers, dates) are not enclosed in quotes, like: 123, 2014-12-31 The decimal separator is the point ".", e.g. 0.125 Line breaking characters (LF, CR) are replaced by " \n ". They occur most frequently in the abstract text and the NPL bibliographic text. Except "\n", no other characters are escaped.
... View more