• Please be aware: Kaspersky Anti-Virus has been deprecated
    With the upgrade to Plesk Obsidian 18.0.64, "Kaspersky Anti-Virus for Servers" will be automatically removed from the servers it is installed on. We recommend that you migrate to Sophos Anti-Virus for Servers.
  • The Horde webmail has been deprecated. Its complete removal is scheduled for April 2025. For details and recommended actions, see the Feature and Deprecation Plan.
  • We’re working on enhancing the Monitoring feature in Plesk, and we could really use your expertise! If you’re open to sharing your experiences with server and website monitoring or providing feedback, we’d love to have a one-hour online meeting with you.

Backup - encode: source UTF-8 encoding declared, but invalid UTF-8 sequence occured

AndiS

New Pleskian
Hi,

the backup migration result shows
<message severity="warning" id="0272ed7e-a013-456d-8e6d-015ef7205db5" code="msgtext">
<description>Encoding::encode: source UTF-8 encoding declared, but invalid UTF-8 sequence occured. Falling back to ISO-8859-1</description>

for two domains.
The kb article
http://kb.odin.com/en/124608

show a resolution, but how to find the files which are stored with the wrong encoding ?
There is not specific file name listed in the log. Only some id's.

Resolution
Make sure that there are no non UTF8 characters in file names in domain directory /var/www/vhosts/domain.tld/httpdocs. Rename these files and try to backup once again.


Andreas
 
Go to /var/www/vhosts/ and run two commands:

# find . -print | sed 's;[^/]*/;|___;g;s;___|; |;g' > files1.txt
# grep --color='auto' -P -n "[\x80-\xFF]" files1.txt

With first command file files1.txt with all directories structure will be created.
Second command will highlight by red all non-usual symbols in files1.txt
 
Hi Igor,

great. I found a lot of files with german 'Umlauts' which are stored in the wrong encoding.

Andreas
 
Back
Top