Jira (PDB-3186) Allow limits in anonymized exports

2 views
Skip to first unread message

Ryan Senior (JIRA)

unread,
Nov 15, 2016, 11:27:22 AM11/15/16
to puppe...@googlegroups.com
Ryan Senior created an issue
 
PuppetDB / New Feature PDB-3186
Allow limits in anonymized exports
Issue Type: New Feature New Feature
Assignee: Unassigned
Created: 2016/11/15 8:26 AM
Labels: maintenance
Priority: Normal Normal
Reporter: Ryan Senior

The current user experience of an anonymized export of a PuppetDB database is pretty bad. We currently export all of the data from PDB instance (anonymizing as we go) and that is returned via HTTP as a tar.gz file. For PDB databases of any decent size, this takes a very long time and can be very large (i.e. 20+ GB). This makes it a time-consuming and difficult process.

The reason this process takes so long is because we are exporting all reports. There is value in having reports, but there's not a lot of value in having every report for every node.

We should think through what changes we can make to limit the export/anonymization of reports and still get similar value. The result of this ticket should be a set of tickets with info on what we should change. Some suggestions:

  • Export only reports that have changes
  • Allow exporting only a given number of nodes worth of data
  • Change the benchmark tool to synthetically create unchanged reports from a list of only changed reports
Add Comment Add Comment
 
This message was sent by Atlassian JIRA (v6.4.14#64029-sha1:ae256fe)
Atlassian logo

Ryan Senior (JIRA)

unread,
Nov 28, 2016, 11:58:02 AM11/28/16
to puppe...@googlegroups.com
Ryan Senior commented on New Feature PDB-3186
 
Re: Allow limits in anonymized exports

We should also change the order in which things are exports, catalogs then facts then reports

Ryan Senior (JIRA)

unread,
Nov 28, 2016, 11:59:03 AM11/28/16
to puppe...@googlegroups.com
Ryan Senior updated an issue
 
Change By: Ryan Senior
Sprint: Hopper

Ryan Senior (JIRA)

unread,
Nov 28, 2016, 11:59:03 AM11/28/16
to puppe...@googlegroups.com
Ryan Senior updated an issue
Change By: Ryan Senior
Story Points: 1 3

Ryan Senior (JIRA)

unread,
Mar 9, 2017, 4:39:11 PM3/9/17
to puppe...@googlegroups.com
Ryan Senior updated an issue
Change By: Ryan Senior
Sprint: Hopper

Ryan Senior (JIRA)

unread,
Mar 22, 2017, 7:05:02 PM3/22/17
to puppe...@googlegroups.com

Russell Mull (JIRA)

unread,
Jun 14, 2017, 12:21:37 PM6/14/17
to puppe...@googlegroups.com
Russell Mull updated an issue
Change By: Russell Mull
Sprint: Hopper

Claudia Petty (Jira)

unread,
Jun 21, 2023, 10:56:08 AM6/21/23
to puppe...@googlegroups.com
Claudia Petty updated an issue
Change By: Claudia Petty
Labels: maintenance new-feature
This message was sent by Atlassian Jira (v8.20.21#820021-sha1:38274c8)
Atlassian logo
Reply all
Reply to author
Forward
0 new messages