AdWords is now Google Ads. Our new name reflects the full range of advertising options we offer across Search, Display, YouTube, and more. Learn more

Ads
3.6K members online now
3.6K members online now
For questions related to Google Shopping and Merchant Center. Learn to optimize your Shopping ads
Guide Me
star_border
Reply

Help with Special Characters in Category Taxonomy...

Visitor ✭ ✭ ✭
# 1
Visitor ✭ ✭ ✭

Hi All,

 

To give a bit of background, we using a Latin-1 based platform to import the text version Google Taxonomy into our application to generate the Product Feed:

 

http://www.google.com/basepages/producttype/taxonomy.en-GB.txt

 

The actual taxonomy from google contains special characters, ie:

"Food, Beverages & Tobacco > Beverages > Wine > Rosé Wine"

 

Our system can not handle these "é" characters and the import fails to process these lines.

 

Is anyone else come across this before, or have any alternative versions of this txt file with different encoding of these characters?

 

Thanks

1 Expert replyverified_user

Re: Help with Special Characters in Category Taxonomy...

Participant ✭ ✭ ✭
# 2
Participant ✭ ✭ ✭
Hi Arun,
I've never come across before, but have you tried uploading w/o the é? Maybe substitute an "e" and see if that works?

If that doesn't work, try replacing é with %C3 %A9 (this is encoding that might enable it to work).

Re: Help with Special Characters in Category Taxonomy...

Visitor ✭ ✭ ✭
# 3
Visitor ✭ ✭ ✭
Thank you for the reply.

Basically what we're trying to do is create a script to import the content of the URL above, so that if the taxonomy file changes, the script will always take the latest version for our feed(s).

I can do it manually as you've described, but we're trying to completely automate it but because our platform can't handle UTF-8, so wondering if anyone else has come across this and has worked around a solution? OR found an alternative URL with different character encoding?

Thanks

Re: Help with Special Characters in Category Taxonomy...

[ Edited ]
Top Contributor
# 4
Top Contributor

a google_product_category value must be exactly as google defines.

choices include:
- use only utf-8 for all data submitted;
- use the highest-level taxonomy that categorizes the
physical item but only uses the us-ascii character-set; e.g.
Food, Beverages & Tobacco > Beverages > Wine

generally, encoded character-sets cannot

be mixed within the same submitted data; e.g.

é
é