•1 min read•from Machine Learning
Built an AI tool that cleans datasets, fills missing values, and predicts unknown fields [P]
![Built an AI tool that cleans datasets, fills missing values, and predicts unknown fields [P]](/_next/image?url=https%3A%2F%2Fpreview.redd.it%2Fzg2tjekil0vg1.png%3Fwidth%3D140%26height%3D65%26auto%3Dwebp%26s%3D5de652e8e2ad181017cf754ef071f99bd83eab83&w=3840&q=75)
| I built a Streamlit-based AI data analysis tool that: • Fills missing values using ML models (not just mean/median) • Predicts any missing column using n-1 inputs • Detects anomalies • Shows correlations and feature importance • Lets you download the updated dataset (Attached images show the UI and before vs after CSV file with a sample CSV available on the GitHub page, as well as an image showing the achieved performance metrics) I wanted to test how well it works on real-world incomplete datasets. Would love feedback on: - model approach - accuracy issues - any improvements I should make GitHub: https://github.com/WALKER00058/ML-data-analysis/tree/main [link] [comments] |
Want to read more?
Check out the full article on the original site
Tagged with
#generative AI for data analysis
#conversational data analysis
#Excel alternatives for data analysis
#data analysis tools
#rows.com
#real-time data collaboration
#big data performance
#big data management in spreadsheets
#intelligent data visualization
#data visualization tools
#enterprise data management
#data cleaning solutions
#large dataset processing
#cloud-based spreadsheet applications
#financial modeling with spreadsheets
#real-time collaboration
#AI tool
#data analysis
#datasets
#missing values