About 50 results
Open links in new tab
  1. Python versus ETL tools : r/dataengineering - Reddit

    Python is a language, trying to do data engineering using open ("file.parquet", "r") or even better using boto3 to read individual objects would be outrageously complex and idiotic. It's the tools/libraries you …

  2. Python and ETL : r/dataengineering - Reddit

    Nov 4, 2021 · Python is rarely a good choice for ETL. With modern data warehouses like Snowflake, you can write a few lines of SQL and leave the query optimizer do their work: you don't have to worry …

  3. Python over SSIS for ETL : r/BusinessIntelligence - Reddit

    Oct 18, 2023 · I keep seeing this craze of using Python to ingest data and also for ETL over something like SSIS. To me, SSIS is hugely performant and can be done by non programming people. What am …

  4. ETL using pure python (no Pandas) : r/dataengineering - Reddit

    Jan 23, 2023 · I’ve heard that using pure python is a good step before using Spark, but I can’t find an example of ETL using just pure python. How would that work? Is there an example script for an ETL …

  5. How can I schedule python ETL code? : r/dataengineering - Reddit

    Jun 29, 2023 · But I feel like if you're looking for something python based with a nice UI, check out Dagster or Prefect. Dagster bills itself as more data ETL oriented, while prefect seems to be more …

  6. Best place to learn Python for ETL? : r/learnpython - Reddit

    Dec 22, 2022 · Python is versatile enough to ETL a lot of ways with a lot of database types. I would focus on learning about ETL itself then circle back to python. A lot of packages are out there to help …

  7. Best way to ETL : r/dataengineering - Reddit

    Dec 28, 2022 · Best way to ETL is to have the consumers talk to the producers and have them build the transformation and modelling logic at the source. Then you can build dumb pipes to move the data to …

  8. The expert way of structuring a project for Python ETL. : r/ETL

    Aug 29, 2019 · The expert way of structuring a project for Python ETL. Hi, I have recently moved from Informatica based ETL project to Python/Pyspark based ETL. I want to know the best way to …

  9. Python or SQL for ETL? Or both? : r/dataengineering - Reddit

    In fact, using Python to trigger SQL scripts as part of a larger ETL process is very common - maybe even the most common pattern at this point. One way of using Python to trigger other processes as …

  10. Where to start with Python for data pipelines? - Reddit

    Aug 26, 2023 · Without knowing the specifics about your data sources, I'm inclined to say that you can build an ETL workflow using Python (pandas is your friend). With that said, I'm a fan of the Microsoft …