Message Boards Message Boards


[✓] Avoid importing existing data as missing data?

Posted 1 year ago
2 Replies
3 Total Likes

Hello, Community

I have imported data from a csv file using the following code:

data = SemanticImport[ "C:\\Users\\Thadeu\\Documents\\Kaggle\\train.csv"]

When I investigate the imported data, I see missing values on the DataSet (house$size for example) enter image description here

that are not missing from the original data (notice house$size and nbaths as well) enter image description here

What's going on? Is it possible to fix it?

Thanks, Thad

2 Replies

It probably has decided that that is an integer column based on the first or first few lines, and it won't import a floating point number like 1.5 as it doesn't fit that column's data type. Try specifying column types manually.

Thanks, Szabolcs

I specified column types manually and it worked fine.



Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
or Discard

Group Abstract Group Abstract