Re: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager

in reply to Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager

Postgres nor a DBI dbhandle do threads (as 1nickt already implied).

I don't think doing this kind of parallel access is going to help all that much to get more out of the DB. (But it might be useful for testing, I suppose.)

And it seems to me you're forgetting the hard part: UPDATE. Do you not need that?

Comment on Re: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager

Replies are listed 'Best First'.
Re^2: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager by perlygapes (Sexton) on Apr 14, 2020 at 00:55 UTC
Thanks erix. I do not need to update, actually. All I need to do is check if an identical record already exists and INSERT if not, else move onto the next 'candidate record' to insert. I will have multiple CPUs all generating candidates for insertion into the table but the records must be unique. I know Postgres will tell me if I try to create a duplicate record on a field I have set to UNIQUE, but I am not sure exactly how to check for that in Perl and make sure the script continues rather than dies. Incorporating the 'dbh-per-thread/process' idea, the pseudo-code would go something like: `foreach $CPU (@CPUs) { create dbh for this CPU; prepare_cached SELECT; prepare_cached INSERT; foreach combination { # criterion have affinity with CPU core # so criterion is always unique to CPU generate row-data; # check if row already exists; # what the SELECT sth is for if not then { insert new row; # what the INSERT sth is for } } }` [download]	[reply] [d/l]
Re^3: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager by 1nickt (Canon) on Apr 14, 2020 at 03:19 UTC
Hi again, maybe what you need is an no-op "upsert", implemented in Postgres with `INSERT ON CONFLICT` ? Something like `INSERT INTO myTable (foo, bar) VALUES ('baz', 'qux') ON CONFLICT (foo) DO NOTHING;` [download] ... where `foo` is your column that has a unique key constraint. Hope this helps! The way forward always starts with a minimal test.	[reply] [d/l] [select]
Re^4: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager by perlygapes (Sexton) on Apr 30, 2020 at 13:53 UTC
Thank you for your suggestion. I realise I probably did not explain fully enough my goal. Let's say I have 10 columns in my table (excluding the PK). I only want to store a data row into the table if the combination of all 10 values is a unique combination set. Example with a 3 column table: `INSERT Row 1: A,A,A <== OK! Value set unique INSERT Row 2: A,D,A <== OK! Value set unique INSERT Row 3: D,T,G <== OK! Value set unique INSERT Row 4: D,A,A <== OK! Value set unique INSERT Row 5: A,D,A <== COLLISION with Row 2! Set not unique; skip IN +SERT` [download]	[reply] [d/l]
Re^4: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager by perlygapes (Sexton) on May 08, 2020 at 05:53 UTC
Thanks. I think I understand what that does, but the unique requirement is not on just one field, it is on the whole record. That is, the collection of all field values comprises a UNIQUE combination, and there must not be any other rows with that exact same combination of values.	[reply]
Re^5: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager by 1nickt (Canon) on May 08, 2020 at 16:33 UTC
Re^6: Create parallel database handles or SQL statements for multi-threaded/process access to Postgres DB using DBI, DBD::Pg and Parallel::ForkManager by erix (Prior) on May 08, 2020 at 18:52 UTC

In Section Seekers of Perl Wisdom