Glue crawler col_0, col_1 not recognizing header

https://stackoverflow.com/questions/54373335/aws-glue-crawler-cannot-extract-csv-headers

Quick takeaway

Glue header identifier is fragile. Make sure the column names are valid SQL names (i.e. no spaces) and there’re no empty column names (happens often when exporting from excel)

I was having the same issue where Glue does not recognize the header row when all columns are Strings

I found that adding a new column on the end with an integer solves the problem

id,name,extra_column sdf13,dog,1

Leave a comment