Jira (PDB-5105) Benchmark console query performance and identify slow queries

1 view
Skip to first unread message

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 10:58:02 AM4/14/21
to puppe...@googlegroups.com
Bogdan Irimie created an issue
 
PuppetDB / Task PDB-5105
Benchmark console query performance and identify slow queries
Issue Type: Task Task
Assignee: Unassigned
Created: 2021/04/14 7:57 AM
Priority: Normal Normal
Reporter: Bogdan Irimie
Add Comment Add Comment
 
This message was sent by Atlassian Jira (v8.13.2#813002-sha1:c495a97)
Atlassian logo

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 10:59:03 AM4/14/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 10:59:03 AM4/14/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 11:00:04 AM4/14/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 11:01:02 AM4/14/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 11:05:03 AM4/14/21
to puppe...@googlegroups.com
Bogdan Irimie updated an issue
We should benchmark the queries gathered in PDB-5043 with a PDB instance that has data for 10, 100, 1000, 10000 and 100000 nodes.

PDB can be field with synthetic data generated by the benchmark tool from PDB.

Bogdan Irimie (Jira)

unread,
Apr 14, 2021, 11:08:01 AM4/14/21
to puppe...@googlegroups.com
Bogdan Irimie updated an issue
Change By: Bogdan Irimie
Acceptance Criteria: A CSV with timing for all queries from PE console that ran against a PDB with data for 10, 100, 1000, 10000, 100000 nodes. 

Bogdan Irimie (Jira)

unread,
Apr 15, 2021, 1:34:02 PM4/15/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 15, 2021, 1:34:03 PM4/15/21
to puppe...@googlegroups.com

Zachary Kent (Jira)

unread,
Apr 15, 2021, 3:16:04 PM4/15/21
to puppe...@googlegroups.com
Zachary Kent commented on Task PDB-5105
 
Re: Benchmark console query performance and identify slow queries

Related to this ticket CD4PE has gathered some of the common PDB queries they use in a Google doc linked to CDPE-4097. That should be another source of possible queries we could help improve.

Bogdan Irimie (Jira)

unread,
Apr 21, 2021, 3:35:02 AM4/21/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 21, 2021, 3:35:03 AM4/21/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Apr 21, 2021, 3:42:04 AM4/21/21
to puppe...@googlegroups.com

Eric Thompson (Jira)

unread,
Apr 30, 2021, 12:24:03 PM4/30/21
to puppe...@googlegroups.com

Eric Thompson (Jira)

unread,
Apr 30, 2021, 2:14:02 PM4/30/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Jun 16, 2021, 3:28:02 AM6/16/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Jun 16, 2021, 3:29:02 AM6/16/21
to puppe...@googlegroups.com

Bogdan Irimie (Jira)

unread,
Jun 16, 2021, 4:49:02 AM6/16/21
to puppe...@googlegroups.com
Bogdan Irimie updated an issue
Change By: Bogdan Irimie
Acceptance Criteria: A One or multiple CSV with timing for all queries from PE console that ran against a PDB with data for 10, 100, 1000, 10000, 100000 nodes. 

Andrei Filipovici (Jira)

unread,
Jun 23, 2021, 6:20:01 AM6/23/21
to puppe...@googlegroups.com

Andrei Filipovici (Jira)

unread,
Jun 28, 2021, 9:46:02 AM6/28/21
to puppe...@googlegroups.com
Andrei Filipovici commented on Task PDB-5105
 
Re: Benchmark console query performance and identify slow queries

To run the tests on vmPooler, you need to install PE, python3 and python3-devel.
Then copy over the load-test folder from the project and run locust

Andrei Filipovici (Jira)

unread,
Jun 28, 2021, 11:53:01 AM6/28/21
to puppe...@googlegroups.com
Andrei Filipovici updated an issue
 
Change By: Andrei Filipovici
We should benchmark the queries gathered in PDB-5043 with a PDB instance that has data for 10, 100, 1000, 10000 and 100000 nodes.

PDB can be field with synthetic data generated by the benchmark tool from PDB.

Bogdan Irimie (Jira)

unread,
Jun 30, 2021, 3:35:03 AM6/30/21
to puppe...@googlegroups.com
Bogdan Irimie updated an issue
Change By: Bogdan Irimie
Sprint: ghost-30.06.2021 , ready for triage 3

Andrei Filipovici (Jira)

unread,
Jun 30, 2021, 10:03:14 AM6/30/21
to puppe...@googlegroups.com
Andrei Filipovici updated an issue
Change By: Andrei Filipovici
Attachment: Screenshot 2021-06-30 at 16.53.07.png

Andrei Filipovici (Jira)

unread,
Jun 30, 2021, 10:09:04 AM6/30/21
to puppe...@googlegroups.com
 
Re: Benchmark console query performance and identify slow queries

I added the 3 sandboxes (1000, 10.000, 100.000) to pdb-n1-prod.ops.puppetlabs.net, at: /home/andrei.filipovici/sandboxes

I added in the database section of pdb.ini, in all sandboxes the following settings:
node-ttl = 0s
node-purge-ttl = 0s
report-ttl = 0s
resource-events-ttl = 0s

Ingestion time for the 10.000 nodes sandbox was a little over half an hour and for the 100.000 nodes was roughly 8 and a half hours.
Maximum JVM heap size for the 100.000 nodes sandbox was 10Gb.
In the attachment is a print screen for the 100.000 nodes sandbox after it finished ingestion.

Andrei Filipovici (Jira)

unread,
Jul 1, 2021, 11:02:03 AM7/1/21
to puppe...@googlegroups.com
Andrei Filipovici updated an issue
Change By: Andrei Filipovici
Attachment: Screenshot 2021-06-30 at 16.53.07.png

Andrei Filipovici (Jira)

unread,
Jul 1, 2021, 11:05:02 AM7/1/21
to puppe...@googlegroups.com
Andrei Filipovici updated an issue
Change By: Andrei Filipovici
Attachment: Screenshot 2021-07-01 at 09.09.04.png
Attachment: Screenshot 2021-06-30 at 21.50.12.png
Attachment: Screenshot 2021-06-30 at 21.46.48.png
Attachment: Screenshot 2021-06-30 at 21.46.15.png

Rob Browning (Jira)

unread,
Jul 12, 2021, 8:02:03 PM7/12/21
to puppe...@googlegroups.com
Rob Browning updated an issue
Change By: Rob Browning
Fix Version/s: PDB 6.18.0
Fix Version/s: PDB 7.5.0
Reply all
Reply to author
Forward
0 new messages