Understanding Intuitive for PaperCut Datasets

In this guide we will explain the datasets that are used to support the Intuitive for PaperCut dashboard solution.  It is important that you read this guide before attempting to edit or create new components for the solution.

The Data Integrations feature provided by PaperCut outputs raw data from the database including a list of all print jobs on a daily basis.  The Intuitive for PaperCut solution aggregates this data over time to allow trend analysis over one or multiple years.  The aggregation is an important step as it reduces the amount of storage required and also allows faster processing on the dashboard interface which ultimately means great performance for the end users.

There are four distinct datasets that are used which we have listed down below:

  • Aggregated Dataset
  • List of Active Users
  • Print Usage
  • User Analysis

The Aggregated Dataset contains a 12 month (plus current month), rolling dataset of printing activity. The dataset shows origin of the printing by device and also breaks down usage by department. The aggregated dataset does not contain any document titles or user data.  The dataset is, however, linked to the users datasets to enable lookups e.g. Which users use a certain device.

The User Analysis dataset is used to identify the heaviest users of print and copy within an organization. Each month we keep a running total of pages printed and at the end of the month calculate the top 100 users.  The names of the users and their total for the month they were in the top 100 are recorded to the dataset.  The dataset also records the devices that are being exploited by these users and the type of activity.

The Print Usage dataset contains a record of all print, copy and scan jobs in the last 30 days.  Each job has details of the user that generated it, the activity type and if appropriate also the document title. This dataset enables you to audit any user on the system from the last 30 days 

The List of Active Users dataset also contains data from the last 30 days.  Unlike the print usage dataset, the columns reported are kept to a minimum.  This allows us to quickly calculate and display parameter such as:-

  • How many users have been active in the last 30 days?
  • What is the average cost of a user?

 

The below table lists the fields within each of the datasets, their original name within PaperCut and what dataset(s) they are a member of:

    Aggregated by month (12 months + Current) Last 30 days Last 30 days Aggregated by month. Top 100 users (12 months + current)
           
Intuitive Field PaperCut Field Aggregated Dataset List of Active Users Print Usage User Analysis
User ID used_by_user_id

 

User Name full_name

 

Activity job_type

 

Department department

FileType file_extension

 

Date date

 

 

 

Month date

 

Colour Composition gray_scale

 

Duplex Composition gray_scale

 

Cancelled cancelled

 

Printed printed

 

Queue Type Device_type

 

Device Type printer_type

 

Device Name printer_name

 

Device Location printer_location

 

 

 

Physical Printer ID physical_printer_ID

 

 

 

Paper Size paper_size

 

Serial Number serial_number

 

 

 

Denied Reason denied_reason

 

 

 

Printer Group printer_group

 

 

 

Refunded refunded

 

Job ID printer_usage_log_id

 

 

 

Job Title document_name

 

 

 

Printed Pages total_pages printed=true

 

Total Pages% total_pages

 

 

 

Mono Pages total_pages - total_colour_pages

 

 

Colour Pages total_colour_pages

 

Trees total_pages * tree_parameter

 

 

 

CO2 total_pages * CO2_parameter

 

 

 

Electricity total_pages * Electricity_parameter

 

 

 

Print Cost usage_cost

 

 

 

Scan Cost usage_cost

 

 

 

Total Cost usage_cost

 

 

 

Cost per user usage_cost

 

 

Number of users used_by_user_id

 

 

Cost usage_cost

 

Mono Cost usage_cost & gray_scale

 

 

 

Colour Cost usage_cost & gray_scale

 

 

 

Cost % usage_cost

 

 

Cancelled Pages total_pages & cancelled

 

 

 

Cancelled Pages % total_pages & cancelled

 

 

 

Cost Saved usage_cost & cancelled

 

 

 

Cost before Savings usage_cost

 

 

 

Scanned Pages total_pages & job_type

 

 

Total Cost usage_cost

 

 

 

Mono% usage_cost & gray_scale

 

Colour% usage_cost & gray_scale

 

No of Devices serial_number

 

 

 

Simplex % duplex

 

Duplex % duplex

 

No of Jobs printer_usage_log_id

 

No of Jobs%  

 

 

 

Duplex Pages duplex_pages

 

 

Simplex Pages duplex_pages

 

 

Total Pages total_pages

 

 

Total Printed Sheets total_sheets & printed

 

 

 

Saved Cost Saved cost pivoted by month

 

 

 

Activity Total pages pivoted by activity

 

Scanned Pages Scan pages by month