Platform product includes the Apache Derby database, a JDBC-based relational database system, for use with the reporting feature. The Derby database is a small-footprint, open-source database, and is only appropriate for demo clusters. If you want to use the reporting feature to produce regular reports for a production cluster, you must use a supported commercial database such as Oracle or MySQL.
Data sources are files that store cluster operation and workload information such as host status changes, session, and task status, and so on. product uses several files as data sources. These include daemon status files, and event files.
Data loaders collect the operational data from the data sources and load the data into tables in a relational database. The data loaders connect to the database using a JDBC driver.
The loader controller service (plc) controls the data loaders that collect data from the system, and writes the data into the database.
The data purger service (purger) maintains the size of the database by purging old records from the database and archiving them. By default, the data purger purges all data that is older than 14 days, and purges data every day at 12:30 a.m.
Platform provides a set of out-of-box report templates, called standard reports. These report templates allow you to produce a report to analyze your cluster. The standard reports capture the most common and useful data to analyze your cluster.
You can also create custom reports to perform advanced queries and reports beyond the data produced in the standard reports.