Data Analysis Dashboard: Difference between revisions
No edit summary |
|||
| Line 13: | Line 13: | ||
<p style="line-height: 1.5em"> | <p style="line-height: 1.5em"> | ||
<strong>Data Upload and Preprocessing:</strong> | |||
*Upload datasets via file input or URL. | *Upload datasets via file input or URL. | ||
*Manage missing values using various imputation methods. | *Manage missing values using various imputation methods. | ||
*Encode categorical variables and coerce numeric data. | *Encode categorical variables and coerce numeric data. | ||
<strong>Correlation Analysis:</strong> | |||
*Compute Pearson and Spearman correlations, as well as Euclidean similarity. | *Compute Pearson and Spearman correlations, as well as Euclidean similarity. | ||
*User-defined thresholds for filtering significant relationships. | *User-defined thresholds for filtering significant relationships. | ||
| Line 25: | Line 25: | ||
*Define relationships using user-specified thresholds. | *Define relationships using user-specified thresholds. | ||
<strong>Inference Rules:</strong> | |||
*Input custom IF-THEN rules to add inferred relationships to the RDF graph. | *Input custom IF-THEN rules to add inferred relationships to the RDF graph. | ||
<strong>Visualization:</strong> | |||
*Visualize correlations using interactive Plotly subplots. | *Visualize correlations using interactive Plotly subplots. | ||
*Display the knowledge graph as a network with customizable aesthetics.'' | *Display the knowledge graph as a network with customizable aesthetics.'' | ||
<strong>SPARQL Querying:</strong> | |||
*Query the RDF graph using SPARQL with a user-friendly interface. | *Query the RDF graph using SPARQL with a user-friendly interface. | ||
Revision as of 05:36, 20 October 2025
Dashboard that allows human operators monitor and extract knowledge from tabular data through visualization/interpretability, querying, and inference features.

Asset Description
The Data Analysis Dashboard facilitates the comprehensive monitoring of tabular data (e.g., from sensor arrays) with sophisticated processing and analysis capabilities. Its key benefits include intuitive visualisation, flexible querying, and the ability to infer patterns and detect anomalies, giving human operators critical decision-making support. It uses some classical Data and Knowledge Engineering methods (e.g., Knowledge Graphs) and is implemented based on the Streamlit framework.
Key Features
Data Upload and Preprocessing:
- Upload datasets via file input or URL.
- Manage missing values using various imputation methods.
- Encode categorical variables and coerce numeric data.
Correlation Analysis:
- Compute Pearson and Spearman correlations, as well as Euclidean similarity.
- User-defined thresholds for filtering significant relationships.
- Knowledge Graph Creation:
- Automatically generate RDF graphs representing significant correlations.
- Define relationships using user-specified thresholds.
Inference Rules:
- Input custom IF-THEN rules to add inferred relationships to the RDF graph.
Visualization:
- Visualize correlations using interactive Plotly subplots.
- Display the knowledge graph as a network with customizable aesthetics.
SPARQL Querying:
- Query the RDF graph using SPARQL with a user-friendly interface.
Usage
This software helps users analyse datasets and uncover hidden relationships. Initially the user should perform the following:
- Upload a CSV file containing the data to be analysed
- Configure the data preprocessing aspects (handling missing values and data types)
Afterwards the user can explore their data and underlying relations , extracting knowledge in a flexible way:
- Select pairs of features and see the visualisations of correlation analysis

- Build a knowledge graph that represents significant relationships in the data
- Interact directly with the knowledge graph through a SPARQL query interface

Note: The asset is under Ongoing Development
Licence
This project is licensed under the MIT License.
Resources
- A live demo of this component can be found here
- The source code is available at the following GitHub repository
- Created by Software Competence Center Hagenberg - SCCH
- Contact jorge.martinez-gil@scch.at
Acknowledgement
This tool has been mainly developed in the frame of the project TrineFlex from the European Union’s Horizon Europe research and innovation programme under Grant Agreement No 101058174.