this post was submitted on 08 Jul 2023
-10 points (36.1% liked)

Python

6559 readers
15 users here now

Welcome to the Python community on the programming.dev Lemmy instance!

πŸ“… Events

PastNovember 2023

October 2023

July 2023

August 2023

September 2023

🐍 Python project:
πŸ’“ Python Community:
✨ Python Ecosystem:
🌌 Fediverse
Communities
Projects
Feeds

founded 2 years ago
MODERATORS
 

Are you doing data science? Statistics? No?

Then for god's sake don't use pandas, you just look dumb af when you pull several MB of a package just to load csv. If you find yourself doing that, just stop programming and look for another job

Thanks for attention

all 8 comments
sorted by: hot top controversial new old
[–] [email protected] 27 points 2 years ago (1 children)

If you find yourself doing that, just stop programming and look for another job

I don't think that is an appropriate response to someone misunderstanding a package. Just educate them in a kind and respectful manner and they'll learn.

[–] kSPvhmTOlwvMd7Y7E 2 points 2 years ago

fair enough.

[–] [email protected] 6 points 2 years ago (1 children)

For those that don't know, the standard library includes a csv package so you can just import csv.

Documented at https://docs.python.org/3/library/csv.html

[–] [email protected] 5 points 2 years ago

I use DictReader all the time, along with a library to hande the type detection. This is the way to go, especially if you need to process line by line or filter columns and rows out first.

Regardless, I'll avoid pandas wherever I can. It's not something I want in production level code if I can help it.

[–] [email protected] 2 points 2 years ago (1 children)

Wait what, people are loading data from csv in their websites?

[–] [email protected] 2 points 2 years ago

Makes sense if you want to give the user the ability to import data from csv (see django import export package). Beyond importing data from user or another service i dont see other uses, but they do exist.

[–] [email protected] 1 points 2 years ago (1 children)

Pandas is more efficient than Python at operating on large datasets. Can you suggest alternarives?