• Nenhum resultado encontrado

Civil identification with DNA profiles databases using Bayesian networks

N/A
N/A
Protected

Academic year: 2021

Share "Civil identification with DNA profiles databases using Bayesian networks"

Copied!
6
0
0

Texto

(1)

Repositório ISCTE-IUL

Deposited in Repositório ISCTE-IUL:

2018-06-12

Deposited version:

Post-print

Peer-review status of attached file:

Peer-reviewed

Citation for published item:

Andrade, M. & Ferreira, M. A. M. (2017). Civil identification with DNA profiles databases using Bayesian networks. Acta Scientiae et Intellectus. 3 (4), 32-35

Further information on publisher's website:

http://www.actaint.com/

Publisher's copyright statement:

This is the peer reviewed version of the following article: Andrade, M. & Ferreira, M. A. M. (2017). Civil identification with DNA profiles databases using Bayesian networks. Acta Scientiae et Intellectus. 3 (4), 32-35. This article may be used for non-commercial purposes in accordance with the

Publisher's Terms and Conditions for self-archiving.

Use policy

Creative Commons CC BY 4.0

The full-text may be used and/or reproduced, and given to third parties in any format or medium, without prior permission or charge, for personal research or study, educational, or not-for-profit purposes provided that:

• a full bibliographic reference is made to the original source • a link is made to the metadata record in the Repository • the full-text is not changed in any way

The full-text must not be sold in any format or medium without the formal permission of the copyright holders.

Serviços de Informação e Documentação, Instituto Universitário de Lisboa (ISCTE-IUL) Av. das Forças Armadas, Edifício II, 1649-026 Lisboa Portugal

Phone: +(351) 217 903 024 | e-mail: administrador.repositorio@iscte-iul.pt https://repositorio.iscte-iul.pt

(2)

Civil Identification with DNA Profiles Databases using Bayesian Networks

Marina Andrade

Instituto Universitário de Lisboa (ISCTE-IUL), Information Sciences, Technologies and Architecture Research Center (ISTAR-IUL) and Business Research Unit (BRU-IUL), PORTUGAL

Manuel Alberto M. Ferreira

Instituto Universitário de Lisboa (ISCTE-IUL), Business Research Unit (BRU-IUL) and Information Sciences, Technologies and Architecture Research Center (ISTAR-IUL), PORTUGAL

E-mails : marina.andrade@iscte.pt

manuel.ferreira@iscte.pt

ABSTRACT

In forensic identification problems it is common the use of DNA profiles. The first bases of DNA profiling data emerged in England in 1995 having raised new challenges in the context of forensic identification. In Portugal the legislation that allowed the creation of a DNA profile database was set in 2008. In this work it is intended to discuss how to properly use the databases in the recognition of bodies problem that compiles information about missing individuals which are known to belong to one or more families whose members have reported the disappearance of relatives.

Key words DNA profiles, Identification problems, Bayesian networks. INTRODUCTION

The use of networks that "carry" probabilities began with the geneticist Sewall Wright in 1921. Since then its use coated in different ways in various fields such as social sciences and economics. Here the models used are generally linear being examples the "Path Diagrams" or "Structural Equation Models (SEM)". In artificial intelligence are usually used nonlinear models called "Bayesian networks" also called "Probabilistic Expert Systems (PES)".

In this work the aim is to approach civil identification problems: the recognition of bodies which compiles information about missing individuals who are known to belong to one or more families whose members have reported the disappearance of family members using the DNA database and information about the genetic profile of the bodies found. In the processing of data we resort to "object-oriented" Bayesian networks, which are an example of PES, using the Hugin

(3)

software.

Portuguese law 5/2008 establishes the principles for creating and maintaining a database of DNA profiles for identification purposes. And regulates the collection, processing and storage of human cell samples, their collection and analysis of DNA profiling, the method for comparing DNA profiles resulting samples, and processing and storage of information in a computer file.

It is assumed that the database consists of a file containing samples delinquent information doomed to 3 or more years in prison - ;a file containing information volunteers samples -; a file containing information about "problem samples" or "reference samples" of bodies or parts of bodies or objects or places where authorities collected samples -  .

Unfortunately, the project to create a DNA profiles database for identification purposes has been abandoned by the Portuguese Government.

CIVIL IDENTIFICATION USING DNA PROFILIES DATABASES-ONE MISSING AND A VOLUNTEER

It is reported the disappearance of an individual. A body is found. The hypotheses are: HA: The body found is that of the missing individual X vs HD: The body found is any other individual than that of X. In general is not admitted the death of relatives and rejects the hypothesis that establishes the loss of the missing relative. A volunteer provides genetic material to be used to test a partial match. The evidence is -E

CBf,Cvol

- the genetic trait of the body found,CBF, and

the volunteer, Cvol. The odds a posteriori (ratio of a posteriori probability) is

 

          , | , | , , | , , | , , | , , |        vol H P vol H P vol H E P vol H E P vol E H P vol E H P D A D A D A .

Assuming thatP

HA|vol,

 

PHD|vol,

, an usual proceeding,

. , , | , , | , , , | , , , , | , , | , , | , , | , , , | , , , | , , | 1                                                           vol H C P vol H C P vol H C C P vol H C C P vol E H P vol E H P vol H C C P vol H C C P vol E H P vol E H P D vol A vol D vol BF A vol BF D A ratio likelihood D vol BF A vol BF D A

(4)

Genetic voluntary feature is not in the conditioning events. Whether voluntary or not related to the individual whose body was found it does not provide information to the uncertainty about its genotype.

Depending on the volunteer and his family relationship with the missing person just can observe a partial match with a volunteer. Furthermore it is important to check for a match between

BF

C and some of the "sample problem" in  . Assuming no coincidence of CBF and any  sample, the likelihood ratio may be written as:

. | , | D BF A vol BF H C P H C C P

If only one volunteer is considered the likelihood ratio can be calculated using a Bayesian network. Suppose you have a missing person, an old man or woman, and a son or daughter who announces the disappearance and voluntarily provides its own genetic profile. The likelihood ratio can be calculated using the following chain:

Figure 1: Network for the civil identification with one volunteer, son or daughter

For a case with a volunteer, but now for example a brother or sister of a missing person who is being sought, the likelihood ratio can be calculated using the following Bayesian network:

(5)

Figure 2: Network for the civil identification with one volunteer, brother or sister

The odds a posteriori is, as it was seen, the likelihood ratio, which allows for benchmarking between HA and HD.

In civil identification problem, the identification evidence may involve more than one individual. It is a case already considered, see for example (Andrade and Ferreira, 2010a).

REFERENCES

1) D. Balding (2002).The DNA Database Controversy. Biometrics, 58 (1), 241-244.

2) F. Corte-Real (2004). Forensic DNA Databases. Forensic Science International,

S143-S144.

3) M. Andrade, M. A. M. Ferreira (2007). Analysis of a DNA mixture sample using object-o

riented Bayesian networks. 6th International Conference APLIMAT 2007; Bratislava; Slovakia,295-305

4) M. Andrade, M. A. M. Ferreira (2009). Criminal and Civil Identification with DNA

Databases Using Bayesian Networks. International Journal of Security, 3 (4), 65-74.

5) M. Andrade, M. A. M. Ferreira (2010). Evaluation of paternities with less usual data using

Bayesian networks. Proceedings - 2010 3rd International Conference on Biomedical Engineering and Informatics, BMEI 2010, Yantai, China. Volume 6, 2010, Article number 5639678, Pages 2475-2477. DOI: 10.1109/BMEI.2010.5639678

6) M. Andrade, M. A.M. Ferreira (2010a). Civil Identification Problems with Bayesian

Networks Using Official DNA Databases. APLIMAT – Journal of Applied Mathematics, 3 (3), 155-162.

7) M. Andrade, M. A.M. Ferreira (2013). Simple Maternity Search with Bayesian Networks.

12th Conference on Applied Mathematics, APLIMAT 2013, Proceedings. Bratislava, Slovakia.

(6)

problems. Applied Mathematical Sciences, 8 (137-140), 6963-6967.

DOI:10.12988/ams.2014.48617

Imagem

Figure 1: Network for the civil identification with one volunteer, son or daughter   For a case with a volunteer, but now for example a brother or sister of a missing person  who is being sought, the likelihood ratio can be calculated using the following B
Figure 2: Network for the civil identification with one volunteer, brother or sister

Referências

Documentos relacionados

O presente artigo apresenta como o jogo de mobilização social e transformação comunitária criado pelo Instituto Elos, jogo oasis, contribuiu para as condições de

A empresa vencedora poderá retirar o instrumento equivalente (Nota de Empenho) na Reitoria do IFRJ à Rua Pereira de Almeida, 88 - Praça da Bandeira - Rio de Janeiro/RJ. A

Hoje, mais do que nunca, as FSS trabalham de modo a antecipar a consolidação dos factos que possam pôr em causa a ordem, a tranquilidade e a segurança pública (Manuel

Existe ainda a possibilidade de o protocolo trabalhar com o padrão mais simples, de 64 bits onde a chave pode ser de 40 ou 24 bits, portanto o padrão de cifragem dos dados é

Nas páginas subsequentes será apresentada uma análise estatística não só sobre as características dos indivíduos sub e sobre-educados, como também sobre

Uma estratégia de preço baseada no valor no CE deverá refletir os atributos que são dife- renciáveis para uma lucratividade no longo pra- zo. Desta forma, atingir o segmento alvo

(DILTHEY, 1951, p.367) Se a delimitação é impossível pela infinidade das vivências, suas expressões e objetivações são delimitáveis pelo tipo, ou seja, pela uniformidade

O pensamento, enquanto modus operandis na busca de soluções do social-histórico, ou seja, o caminho para conhecer, deve ser empreendido como força mental que exige a