• Nenhum resultado encontrado

Keyword Search over COVID-19 Data Yenier T. Izquierdo

N/A
N/A
Protected

Academic year: 2022

Share "Keyword Search over COVID-19 Data Yenier T. Izquierdo"

Copied!
25
0
0

Texto

(1)

Keyword Search over COVID-19 Data

Yenier T. Izquierdo1,2, Gre1el M. García2, Melissa Lemos1,2, Alexandre Novello1, Bruno Novelli2, Cleber Damasceno2,

Luiz André P.P. Leme3, Marco A. Casanova1,2

1Department of Informa-cs, PUC-Rio, Rio de Janeiro, RJ – Brazil

(2)

2

Agenda

• Mo#va#on

• DANKE

CovidKeys

• Conclusions

(3)

Mo)va)on

• Examples of COVID-19 data

FAPESP / USP / Ins,tuto Fleury, H. Sírio-Libanês and H. Israelita Albert Einstein

The COVID-19 Data Sharing / BR ini7a7ve

Ministry of Health/Brazil

NSG (No#ficações de Síndrome Gripal)

SRAG 2020 - Acute Respiratory Syndrome (SARS) and COVID-19 data

The Center for Systems Science and Engineering (CSSE) at Johns Hopkins University

COVID-19 Data Repository

Our World in Data – Global Change Data Lab / Oxford Mar,n School

Coronavirus pandemic: daily updated research and data

The Google Cloud services

(4)

Mo)va)on

• COVID-19 data

• Data is typically available...

as downloadable CSV files

as datasets that can be queried through a user interface (e.g., Elas,c Search)

as datasets available through a "sandbox" (e.g., Google Cloud services)

• Files can be quite large, with hundreds of columns and millions of lines

• Data can be at different levels of granularity

4

(5)

Mo)va)on

• COVID-19 data

• Cumbersome to process with the usual desktop spreadsheet tools

• A perhaps more robust approach:

Download and store the data in a standard DBMS

Define a query that retrieves the data the user is interested in

Export the query results to a data analysis tool

(6)

Mo)va)on

• COVID-19 data

• Cumbersome to process with the usual desktop spreadsheet tools

• A perhaps more robust approach:

Download and store the data in a standard DBMS

Define a query that retrieves the data the user is interested in

Export the query results to a data analysis tool

6

Keyword search

(7)

Motivation

• Keyword search over databases

• the user specifies a few terms, called keywords

A keyword query is simply a list of keywords

• the system must retrieve the data that best match the list of keywords

Keyword search has been expanded to rela,onal databases (and RDF datasets)

Keyword queries are typically compiled into SQL (or SPARQL)

(8)

8

Agenda

• Mo#va#on

• DANKE

CovidKeys

• Conclusions

(9)

DANKE

• What is DANKE?

• a platform for data and knowledge retrieval

• Main Components

• Data Ingestion and Indexing Services

• Keyword Query Processing Service

Compiles a keyword query into an SQL (or SPARQL) query

Uses the database schema to guide the process

• ...

(10)

10

name P1

name

P2

“Richard Burton”

“Liz Taylor”

name

“John Hurt”

P3 Query: Taylor Burton Hurt

(11)

name P1

name

P2 rel

win

A3

descr

name win

P3 Query: Taylor Burton Hurt

"Valladolid Best Actor

(1984)"

(12)

12

name A1

P1

M1 act

title

name descr

P2

“Richard Burton”

act A2

win descr

rel

“Liz Taylor”

“Oscar Best Actress”

"Oscar Best Cinematography"

"Who's Afraid of Virginia Woolf"

win

A3

descr

name win

“John Hurt”

P3 Query: Taylor Burton Hurt

award

?v1 name “Taylor”

?v2 name “Burton”

?v3 name “Hurt”

?v1 rel ?v2

?v2 win ?v4

?v3 win ?v4

"Valladolid Best Actor

(1984)"

(13)

...

Taylor Burton Hurt

(14)

14

Agenda

• Mo#va#on

• DANKE

CovidKeys

• Conclusions

(15)

Experiments with CovidKeys

CovidKeyS

• Web applica-on that offers keyword search over COVID-19 data

• Built on top of DANKE

• Three proof-of-concept scenarios

No,ficações de Síndrome Gripal - NSG

Google COVID-19 Public Datasets

Google Mobility

John Hopkins University

World Bank Global

FAPESP COVID-19 Data Sharing/BR

(16)

Experiments with CovidKeys

• Scenario 3 – FAPESP COVID-19 Data Sharing/BR Dataset (July 2020)

25

Database with two tables and

a 1-to-N rela3onship

(17)
(18)

29

(19)
(20)
(21)

Agenda

• Mo#va#on

• DANKE

• Architecture

• Keyword Query Processing

• Experiments with CovidKeys

• Conclusions

(22)

Conclusions

• Summary

• DANKE - a plaZorm for data and knowledge retrieval

CovidKeyS

uses DANKE to implement keyword queries over three COVID-19 data scenarios

The NSG (No7ficações de Síndrome Gripal) dataset

Global Data (from the Google COVID-19 Public Datasets)

The FAPESP COVID-19 Data Sharing / BR ini3a3ve

33

(23)

Conclusions

• What else?

• Extend DANKE to...

Operate over federated databases

Process change sets

• Extend keyword query processing to support...

Spa,al operators (e.g., near this point)

Temporal operators (e.g., same date)

Aggrega,on opera,ons

• ... beyond keyword query

(24)

35

Thank You

(25)

References for this presenta/on...

casanova puc-rio

Referências

Documentos relacionados

O tratamento serve para tratar os sintomas de síndrome gripal, tanto para covid-19 quanto para gripe. Caso os testes de covid-19 não estejam amplamente disponíveis, confira

O CRM contempla processos de cunho operacional, como o registro de uma reclamação de um cliente até análises aprofundadas sobre produtos que são comumente

Assim, tendo o ceticismo como mediador, a relação direta entre intenção de compra e compra declarada não é afetada e isso pode demonstrar que o mesmo pode

Relacionado a isso, pode-se dizer que, de certa forma, as ações colaborativas são uma espécie de protótipo, afinal de acordo com o texto “Cómohacerun

teórico a empregá-la em seu trabalho sobre o verso russo em estudo com- parativo com o verso tcheco (1923). Mas, afinal, se se está falando tanto em estrutura, o que eles entendiam

Preferência por modelo é qualitativa nominal (seria apenas o modelo de automóvel), Renda mensal pode ser considerada quantitativa contínua, especialmente se for medida em salários

As atividades são extensivas e aplicáveis também aos pacientes que procuram por atendimento na unidade com síndrome gripal e em casos confirmados da

Buscador atualizado com os produtos dos fornecedores sócios da FYVAR (Mediante Api European Sourcing ou através da FYVAR)..