• Nenhum resultado encontrado

SIGMOD2019Presentation.pdf - School of Computing Science

N/A
N/A
Protected

Academic year: 2023

Share "SIGMOD2019Presentation.pdf - School of Computing Science"

Copied!
8
0
0

Texto

(1)

Query-Driven Learning for Next Generation

Predictive Modelling & Analytics

Fotis Savva

University of Glasgow, School of Computing Science Essence: Pervasive & Distributed Intelligence

f.savva.1@research.gla.ac.uk

02/07/2019

http://www.dcs.gla.ac.uk/essence/

Essence:

Pervasive &

Distributed Intelligence

(2)

Exponential Data = Exponential Cost

Data obtained from : https://cloud.google.com/bigquery/pricing

Queries

Answers

Data Store

Computational Cost

Increased waiting times affect

productivity. Interactivity constraint 500ms – 2s [1]

Monetary Cost

(3)

Approximate Query Processing (AQP)

• Provides approximate answers at a fraction of the time

Forwarded Queries

Answers

Data Store SAQP

Queries

Approximate

Answers

Observations:

• Make use of samples; still require storage

• Trade-off accuracy – sampling ratio

• Can break interactivity constraint

• Make use of same infrastructure

(4)

Query-Driven Learning (QDL)

• Use past queries and train Machine Learning models to predict answers

Queries

Predicted Answers

𝑓

Objectives :

• Make NO use of data (Save Money)

• Accurate

• Efficient (Save Time)

• Lightweight

• Data Store Agnostic

Relative Error :

3%

0.0001 seconds

𝑓

Data Store

(5)

Analytics Stack

SAQP Data Store

𝑓

Cache RAM Disk

Accuracy

Speed

(6)

System Overview

(7)

Possible Deployment

BigQuery Pricing :

On-demand : $5 per TB Amazon Redshift :

On-demand : $0.25 per hour (cheapest)

Spectrum : $5 per Terabyte Both also offer flat rate options.

Analytics using QDL and Cloud

functions (Function as a Service - FaaS)

Google Cloud Functions : $0.4 per 1 million function calls (=queries)

AWS Lambda : $0.2 per 1 million

(8)

Thank you for your attention

Essence: Pervasive & Distributed Intelligence

http://www.dcs.gla.ac.uk/essence/

Referências

Documentos relacionados

To be able to make analysis on the impact of the telematic data on the importance of features and further in this paper the impact of them on the quality of our