Linked Questions

18 questions linked to/from Bulk insert a Pandas DataFrame using SQLAlchemy

53 votes

6 answers

127k views

How to speed up bulk insert to MS SQL Server using pyodbc

Below is my code that I'd like some help with. I am having to run it over 1,300,000 rows meaning it takes up to 40 minutes to insert ~300,000 rows. I figure bulk insert is the route to go to speed ...

TangoAlee's user avatar

TangoAlee

1,380

asked Apr 14, 2015 at 22:01

15 votes

4 answers

31k views

Speed up to_sql() when writing Pandas DataFrame to Oracle database using SqlAlchemy and cx_Oracle

Using pandas dataframe's to_sql method, I can write a small number of rows to a table in oracle database pretty easily: from sqlalchemy import create_engine import cx_Oracle dsn_tns = "(DESCRIPTION=(...

breezymri's user avatar

breezymri

4,423

asked Mar 10, 2017 at 21:28

19 votes

2 answers

26k views

Saving a Pandas DataFrame to a Django Model

I have stock price data that is stored in a pandas DataFrame as shown below (actually it was in a panel, but I converted it to a DataFrame) date ticker close tsr 0 2013年03月28日 abc ...

Kritz's user avatar

Kritz

7,341

asked Jun 7, 2016 at 19:46

9 votes

3 answers

20k views

Speeding up Pandas to_sql()?

I have a 1,000,000 x 50 Pandas DataFrame that I am currently writing to a SQL table using: df.to_sql('my_table', con, index=False) It takes an incredibly long time. I've seen various explanations ...

user1566200's user avatar

user1566200

1,848

asked Jan 9, 2017 at 18:54

3 votes

2 answers

8k views

Improve pandas' to_sql() performance with SQL Server

I come to you because i cannot fix an issues with pandas.DataFrame.to_sql() method. I've made the connection between my script and my database, i can send queries, but actually it's too slow for me. I ...

Neal Poidras's user avatar

Neal Poidras

asked Jul 30, 2020 at 10:15

4 votes

2 answers

5k views

psycopg2.ProgrammingError: syntax error at or near "stdin" error when trying to copy_from redshift

I am having this problem when I am trying to copy to AWS redshift. This is the code I am trying to run: with open('path/to/files, 'rb') as fo: cursor.copy_from(fo, 'schema.table', sep=',') ...

Larry's user avatar

Larry

asked Dec 1, 2016 at 7:06

2 votes

2 answers

6k views

Read csv from url one line at the time in Python 3.X

I have to read an online csv-file into a postgres database, and in that context I have some problems reading the online csv-file properly. If I just import the file it reads as bytes, so I have to ...

Esben Eickhardt's user avatar

Esben Eickhardt

3,942

asked Nov 27, 2017 at 13:50

4 votes

1 answer

4k views

How to Insert Huge Pandas Dataframe in MySQL table with Parallel Insert Statement?

I am working on a project where I have to write a data frame with Millions of rows and about 25 columns mostly of numeric type. I am using Pandas DataFrame to SQL Function to dump the dataframe in ...

Dipesh Poudel's user avatar

Dipesh Poudel

asked May 31, 2019 at 4:21

2 votes

4 answers

1k views

Modifying a pandas dataframe with dynamic logic using exec

Let's say I have a script that reads data into a dataframe from a database, runs some logic on that dataframe, and then exports the resulting dataframe into another database table like below. The ...

Jonathan Porter's user avatar

Jonathan Porter

1,568

asked May 3, 2019 at 17:27

1 vote

1 answer

4k views

How to speed up Pandas .to_sql function? [duplicate]

import cx_Oracle import pandas as pd from sqlalchemy import create_engine # credentials username = "user" password = "password" connectStr = "ip:port/service_name" df = pd.read_csv("data.csv") # ...

David Pham's user avatar

David Pham

asked Dec 21, 2018 at 18:39

0 votes

1 answer

3k views

Bulk inserting in python using SQLAlchemy and Pandas [duplicate]

I am migrating from using pyodbc directly in favor of sqlalchemy as this is recommended for Pandas. I could do a simple executemany(con, df.values.tolist()) to bulk insert all rows from my pandas ...

Felipe Carlesso's user avatar

Felipe Carlesso

asked Apr 27, 2023 at 0:33

0 votes

1 answer

1k views

Bulk insert Pandas Dataframe via SQLalchemy into MS SQL database

I have a dataframe with 300,000 rows and 20 columns with a lot of them containing text. In Excel format this is 30 to 40 MB. My target is to write this to the database in below 10min. How can I do ...

Merlin Nestler's user avatar

Merlin Nestler

asked Jul 3, 2023 at 8:00

1 vote

1 answer

843 views

Inserting Excel Data into Postgres with Python

Having a worksheet with ~20.000 rows of data, what is the best approach to insert those into a postgres database? The table in which I will insert the data consists of many foreign keys, which means ...

Bishonen_PL's user avatar

Bishonen_PL

1,581

asked May 17, 2018 at 13:40

0 votes

1 answer

338 views

Time complexity of SQL insert and degrading performance with database size - pyspark parquet files to sql via pandas `to_sql()`

Appending to a SQLite database iteratively using pandas to_sql() functionality appears to be decreasing in its performance with every iteration. The dataframe that I am appending within each ...

chase's user avatar

chase

3,782

asked Dec 5, 2019 at 1:40

1 vote

0 answers

323 views

Pandas speed up to_sql with sqlalchemy

I'm trying to write 300,000 rows to a postgresql database with pandas.to_sql and SQLalchemy. The rows contain some JSON, but mainly String columns (~25 columns total). Current implementation takes ~17 ...

L M's user avatar

L M

asked Aug 15, 2021 at 20:24

15 30 50 per page

2 Next

CollectivesTM on Stack Overflow

Linked Questions

How to speed up bulk insert to MS SQL Server using pyodbc

Speed up to_sql() when writing Pandas DataFrame to Oracle database using SqlAlchemy and cx_Oracle

Saving a Pandas DataFrame to a Django Model

Speeding up Pandas to_sql()?

Improve pandas' to_sql() performance with SQL Server

psycopg2.ProgrammingError: syntax error at or near "stdin" error when trying to copy_from redshift

Read csv from url one line at the time in Python 3.X

How to Insert Huge Pandas Dataframe in MySQL table with Parallel Insert Statement?

Modifying a pandas dataframe with dynamic logic using exec

How to speed up Pandas .to_sql function? [duplicate]

Bulk inserting in python using SQLAlchemy and Pandas [duplicate]

Bulk insert Pandas Dataframe via SQLalchemy into MS SQL database

Inserting Excel Data into Postgres with Python

Time complexity of SQL insert and degrading performance with database size - pyspark parquet files to sql via pandas `to_sql()`

Pandas speed up to_sql with sqlalchemy

Hot Network Questions