With the PaperCut MF data inserted into a SQLite database and then aggregated by the datafeeds, we can perform our last step of data manipulation inside of the datasets.
What sets datasets apart is their ability to perform additional calculations and formatting that a normal SQL query may not be able to recreate easily and seamlessly.
We’ll focus today on just the “User Data - 12 Months " dataset as the topics today will apply to the other seven, but after this module we’d implore you to look at them as well. You may have also noticed that each of the naming conventions for the datasets follows the datafeed naming conventions. It is possible to have multiple datasets capturing data from a single feed, but for ease, we have it on a 1-1 basis due to our needs for the Print Management solution, and maintainability.
Going back to the dataset, however, we can split this into three main areas. Our attributes in blue, measures in red / orange and pivots in green.
Starting with the attributes, these are the points of data we want to measure the costs and volumes against.
We can sum up the attributes themselves into three main sub-groups, date formats, null values and flags. The date formats take the raw timestamp of a DD-MM-YYYY hh:mm:ss, and with the various Display Types, formats it into different types, such as D-M-Y to remove the timestamp, and M-Y for when we wish to show data monthly.
You may want to explore the other date display types here too. You can aggregate by quarters, be it calendar or financial, along with month names, date differences and so on. There are other articles on our knowledge base that delve into that in more detail.
Attributes in datasets also allow you to handle NULL or empty fields. In some cases, the NULL fields are by design where we want no data to come through, so the dataset can redefine it in ways the components can use it to drill further into data. One example of this is the “Show All Users” field, where all NULL'ed data is redefined as “Show All Users”.
The “Show All Users” and “Show All” attributes ultimately stem from the same field inside of the datafeed, “NULL_PARAMETER”. Datasets allow us to bring in the same field from the datafeed multiple times, as we may need to redefine that data into many different ways, such as totals and percentages, but we’ll cover that in a moment under the attributes section.
One area where we don’t want to see blank fields is really for the rest of the attributes. In an ideal world, a customers’ PaperCut MF tenant should be fully configured, with devices and users’ fully set up and up to date. However, in some instances the dashboards can highlight where that is not the case. When we receive unintentional NULL / empty fields for the Department, Device Name, Account Name and so on, we display them as <unknown>.
This recategorises the blank fields to have the exact same name, <unknown>. This allows you or the customer to analyse the data later in the dashboards and filter by this parameter to help you better sense check the PaperCut MF data.
Attributes are also what we use for security filtering. When setting up users to have access to the dashboards, they can be given the exact same dashboards as one and another but have user or group security set on them to have an unremovable pre-filter to the dashboards, so they can only see data pertinent to that attribute. For example, if you’re given the requirement that a head of Sales should only see Sales related data, you can use the Department attribute to build a security filter, so they only see their own Department.
For that process to be facilitated, you’d need to edit each of the datasets, expand the filter icon at the top and add a security type to the attribute you wish to filter on. This gives the software the logic needed to fulfil the security requirements, if it knows that the dataset security equals the department, and the users’ security is a string value that exists in that attribute.
In a standard implementation of Intuitive for PaperCut MF in default settings, security has been set up ahead of time on the Department attribute. You may therefore proceed with the rest of the security setup to limit views on Department names, or alternatively the Department attribute can be changed to another attribute. This is commonly changed to the Account Name, as typically the Education sector uses these cost centres (Account Names) over the Department name. The reason being, is that the Department field may just state that a user is “Faculty” or “Student”, where the Account Name goes into the detail of how they may be charging jobs to particular areas in the school.
With the attributes out of the way, we’ll focus on the measures.
Starting with our volumes, let’s take a look at “Printed Pages” and “Printed Color Pages”. If you take a look at either one, the expression we use is a sum of the associated feed volume, as we largely need to work out a sum for that attribute or combination of attributes. Total Sum’s are typically used for overall percentages, ratios or an overall sum.
Another piece of analysis we show on most dashboards surrounds the counts around a device estate and number of users. This is performed by running a count distinct on attributes such as the ID of the printed job, and the device’s serial number. These are figures which will be unique, and they will be ID’s that shouldn’t change over time. Examples of this are the “No. Users” and “No. Printers” measures.
For example, we wouldn’t want to build a count on the device ID, as the device could be removed and readded to PaperCut MF and may be recognised as a new printer inside of that software. Therefore, we count that device based on the serial number, as that should be static throughout the devices’ lifetime.
Scanning across to “Color %”, this is one of the percentages we calculate from the colour page volumes against the total page volumes. Looking at the expression for this calculation, this is a standard percentage calculation of one figure, represented against a total figure, multiplied by 100 to find the percentage.
In this case, we’re not using a total sum as this percentage will be quite flexible for working out the colour percentage in combination with the other attributes in that grid component. If we introduced a total sum here, we’d be working out the colour percentage overall to the organisation, and that figure would be static, regardless of the other attributes in that grid.
Before we move on, percentages in our datasets also have a symbol at the end / a post-fix. In the case of percentages, we can still work out the figure in the expression but add a ‘%’ symbol to the end of any time this calculation appears in components. The same is also true of costs which we’ll cover after this, where we can place a currency notation before the figure.
By default, costs are represented in the PaperCut MF solution. These costs come directly from PaperCut, with the calculation performed at source for that user, printing that job to that device. We, however, only need to perform a simple sum / total sum, and add a prefix for the currency.
Averages are also used in the dataset, and one example is the “AVG Page per Printer” measure. This is an overall average, so takes the total sum of all pages, and divides it by a total count distinct (the overall number of unique objects) of the devices in the estate.
Outside of this module, go through each of the dataset columns and see how each one has been performed, and also how they link to other datasets. Another area of interest is familiarising yourself with how the dataset matches attributes / measures to the datafeeds, and which columns they are using.