Glue crawler col_0, col_1 not recognizing header

https://stackoverflow.com/questions/54373335/aws-glue-crawler-cannot-extract-csv-headers

Quick takeaway

Glue header identifier is fragile. Make sure the column names are valid SQL names (i.e. no spaces) and there’re no empty column names (happens often when exporting from excel)

I was having the same issue where Glue does not recognize the header row when all columns are Strings

I found that adding a new column on the end with an integer solves the problem

id,name,extra_column sdf13,dog,1

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s