Contribuições para verificação automática de applets javacard

(1)

Universidade Federal do Rio Grande do Norte

Centro de Ciˆ

encias Exatas e da Terra

Departamento de Inform´

atica e Matem´

atica Aplicada

Programa de P´

os-Gradua¸c˜

ao em Sistemas e Computa¸c˜

ao

CONTRIBUIC

¸ ˜

OES PARA VERIFICAC

¸ ˜

AO

AUTOM ´

ATICA DE APPLETS JAVACARD

Antonio Augusto Oliveira Viana da Silva

DISSERTAC

¸ ˜

AO DE MESTRADO

Natal - RN

(2)

Centro de Ciˆ

encias Exatas e da Terra

Departamento de Inform´

atica e Matem´

atica Aplicada

Antonio Augusto Oliveira Viana da Silva

CONTRIBUIC

¸ ˜

OES PARA VERIFICAC

¸ ˜

AO AUTOM ´

ATICA DE

APPLETS JAVACARD

Trabalho apresentado ao Programa de Programa de P´

os-Gradua¸c˜ao em Sistemas e Computa¸c˜ao do Departamento

de Inform´atica e Matem´atica Aplicada da Universidade

Federal do Rio Grande do Norte como requisito parcial

para obten¸c˜ao do grau de Mestre em Ciˆencias da

Com-puta¸c˜ao.

Orientador: David Paul Boris Deharbe

Natal - RN

(3)

(4)

(5)

(6)

Em primeiro lugar gostaria de agradecer a todos os deuses e deusas de nosso mundo, por estarem sempre nos guiando e mostrando o caminho que temos de seguir. Mesmo que às vezes estes pare¸cam tão dif´ıceis e cheios de problemas, aprendemos que devemos sempre persistir, pois, afinal, “as coisas são como elas são porque é assim que elas tinham de ser”.

Agrade¸co também aos meus pais, Antonio Viana da Silva e Maria de Lourdes Oliveira Viana, não só por terem me dado o dom da vida, mas como também por terem cuidado de mim, e me ajudado a caminhar, especialmente quando eu me deparava com um desses caminhos dif´ıceis.

Obrigado também a todos os meus parentes, que, de maneira direta ou indireta, também fizeram parte da minha forma¸cão. Agrade¸co em especial aos meus avós, que nunca conheci (Henrique Oliveira), que tive o prazer de ter ao meu lado durante alguns anos (Esmeralda dos Santos Oliveira e Maria Euzebia da Silva) e com que convivo até hoje (Heleno Viana da Silva).

Também merece meu muito obrigado meu orientador, David Déharbe, pela sua paciência e perseveran¸ca, enquanto me orientava (e às vezes desorientava) durante esse um ano e meio, enquanto eu procurava entender um pouco desse curioso mundo da verifica¸cão formal.

Como não podia deixar de ser, também quero agradecer aos meus amigos (“os irmãos que Deus nos deixou escolher”). Agrade¸co tanto aos amigos que deixei em Aracaju quando vim para Natal, que ficaram lá torcendo por mim e me incentivando, como a todos os que fiz por aqui, que compartilham comigo todos os sofrimentos, esperan¸cas e sucessos obtidos nesse per´ıodo. Não pretendo citar o nome de todos, pois com certeza acabaria esquecendo alguém, mas um abra¸co especial vai para os meus irmãos de cora¸cão Rauflan dos Santos Brito, Roberto Barreto dos Anjos e Alisson Vitório de Lima (valeu mesmo caras!), e também para a “mangua¸ca people”, que, mesmo estando tão dispersa, ainda continuam todos emangua¸cados.

Por último gostaria de agradecer à Ambev, pois seus produtos nos ajudaram a relaxar nos momentos mais estressantes de nosso percurso, afinal ninguém é de ferro!

(7)

RESUMO

O grande crescimento do uso de smart cards (por bancos, companhias de transporte, celulares, etc) trouxe um fato importante, que deve ser considerado: a necessidade de ferramentas que possam ser usadas para verificar os cart˜oes, para que se possa garantir a corretude de seu software.

Como a grande maioria dos cart˜oes desenvolvidos hoje em dia usa a tecnologia JavaC-ard em sua camada de software, o uso daJava Modeling Language (JML) para especificar os programas aparece como uma solu¸c˜ao natural.

JML é uma linguagem de especifica¸cão formal ligada ao Java. Ela foi inspirada pelas metodologias de Larch e Eiffel, e foi largamente adotada como a linguagem de facto em se tratando da especifica¸cão de qualquer programa relacionado à Java.

Várias ferramentas que fazem uso de JML já foram desenvolvidas, cobrindo uma grande gama de funcionalidades, entre elas, a verifica¸cão em tempo de execu¸cão e estática. Mas as ferramentas existentes até o momento para a verifica¸cão estática não são total-mente automatizadas, e, aquelas que são, não oferecem um n´ıvel adequado de completude e seguran¸ca.

Nosso objetivo é contribuir com uma serie de técnicas, que podem ser usadas para alcan¸car uma verifica¸cão completamente automática e segura para applets JavaCard. Nesse trabalho nós apresentamos os primeiros passos nessa dire¸cão. Com o uso de uma plataforma de software composta pelo Krakatoa, Why e haRVey, nós desenvolvemos um conjunto de técnicas para reduzir o tamanho da teoria necessária para verificar as especi-fica¸cões.

Tais técnicas deram resultados muito bons, com ganhos de quase 100% em todos os testes que realizamos, e se provou como uma técnica que deve ser sempre consider-ada, não somente nesse, mas na maioria dos problemas reais relacionado com verifica¸cão automática.

Palavras-chave: Verifica¸c˜ao Formal, Sistemas Embarcados, Java, JML, Provadores de Teorema.

(8)

The widespread growth in the use of smart cards (by banks, transport services, and cell phones, etc) has brought an important fact that must be addressed: the need of tools that can be used to verify such cards, so to guarantee the correctness of their software.

As the vast majority of cards that are being developed nowadays use the JavaCard technology as they software layer, the use of the Java Modeling Language (JML) to specify their programs appear as a natural solution.

JML is a formal language tailored to Java. It has been inspired by methodologies from Larch and Eiﬀel, and has been widely adopted as the de facto language when dealing with specification of any Java related program.

Various tools that make use of JML have already been developed, covering a wide range of functionalities, such as run time and static checking. But the tools existent so far for static checking are not fully automated, and, those that are, do not oﬀer an adequate level of soundness and completeness.

Our objective is to contribute to a series of techniques, that can be used to accomplish a fully automated and confident verification of JavaCard applets. In this work we present the first steps to this. With the use of a software platform comprised by Krakatoa, Why and haRVey, we developed a set of techniques to reduce the size of the theory necessary to verify the specifications.

Such techniques have yielded very good results, with gains of almost 100% in all tested cases, and has proved as a valuable technique to be used, not only in this, but in most real world problems related to automatic verification.

Keywords: Formal Verification, Embedded Systems, Java, JML, Theorem Proving.

(9)

RESUMO ESTENDIDO

0.1 INTRODUC¸ ˜AO

O uso de verifica¸cão formal para verifica¸cão de sofware é um assunto que vem sendo tratado desde os trabalhos seminais de Hoare. Apesar disso só a pouco tempo tais técnicas come¸caram a ser usadas no dia-a-dia. A razão para isso é que até pouco tempo tais métodos só podiam ser usados por especialistas e adicionavam muito ao custo do projeto. Atualmente, com os avan¸cos no campo dos métodos formais, o aumento do poder com-putacional e a queda dos pre¸cos dos computadores, os custos para se utilizar as técnicas formais finalmente compensam os benef´ıcios que elas trazem.

Mas mesmo com esse “balanceamento” não são todas as áreas que estão utilizando os métodos formais. O uso dessas técnicas atualmente está restrito aos projetos onde a necessidade de se obter um software correto é critica, como usinas nucleares, sistemas de trafego aéreo e sistemas embarcados.

Dessas indústrias uma das que mais está em expansão é a dos sistemas embarcados, em especial a de smart cards [78, 38], pequenos sistemas embarcados (geralmente no formato de um cartão de crédito), que estão sendo utilizados largamente ao redor do globo, e que, possivelmente, iram substituir todos os documentos que possu´ımos hoje em dia.

Um dos principais responsáveis para a grande dissemina¸cão dos smart cards foi o JavaCard [66, 67, 68, 77, 79], que trás para para esse mundo todos os benef´ıcios (e a base de programadores) que a tecnologia Java possui.

Para a especifica¸cão e verifica¸cão formal de tais sistemas, o padrão que o mercado vem adotado atualmente é a JML [51, 10]. A Java Modeling Languagem (JML), é uma linguagem de especifica¸cão formal associada ao Java que permite aos usuários da lin-guagem escreverem especifica¸cões formais e de fácil compreensão (já que a sintaxe de JML é baseada em Java). Atualmente já foram desenvolvidas várias ferramentas para trabalharem com as especifica¸cões escritas em JML, que cobrem desde a gera¸cão de doc-umenta¸cão até a verifica¸cão estática.

No entanto as ferramentas existentes para a verifica¸cão estáticas não oferecem uma solu¸cão ideal para o mercado, que necessita de uma ferramenta automática e confiável. Esse trabalho apresenta os passos iniciais para a busca dessa solu¸cão. Aqui serão ap-resentadas técnicas para a redu¸cão de teoria de obriga¸cões de provas geradas a partir de programas Java especificados em JML. Tais técnicas de redu¸cão, embora não direta-mente relacionadas com os sistemas JavaCard, são de fundamental importância para uma verifica¸cão realmente automática. Um exemplo disso é que nos testes que realizamos o uso dessas tecnicas oferece um ganho de até 100% no número de obriga¸cões verificadas, além de oferecer um ganho no tempo de verifica¸cão, para as obriga¸cões que já podiam ser verificadas.

(16)

Nosso trabalho se baseia no uso de uma plataforma de ferramentas composta por: Krakatoa [56, 58, 57], Why [29, 27, 28] e haRVey [13, 22, 71, 20, 21]. Enquanto as duas primeiras ferramentas são usadas para gerar obriga¸cões de provas para várias ferramentas de verifica¸cão, dada um programa Java especificado em JML, a última é um provador de teoremas que se mostrou extremamente eficiente em estudos anteriores [22, 21, 13].

Dando continuidade a essa apresenta¸cão nos apresentaremos um breve histórico sobre os smart cards, em especial a tecnologia JavaCard na Se¸cão 0.2. Na Se¸cão 0.3 serão apresentados alguns dos conceitos básicos da linguagem JML. Em seguida, na Se¸cão 0.4 nos apresentaremos as ferramentas utilizadas nessa trabalho, e na Se¸cão 0.5 mostraremos as técnicas de redu¸cão de teoria desenvolvidas, bem como os resultados que obtivemos. Esse trabalho termina na Se¸cão 0.6 com nossas conclusões e trabalhos futuros.

0.2 UMA ABORDAGEM BASEADA EM JAVA PARA SMART CARDS

A tecnologia dos smart cards permite a utiliza¸cão de um cartão de plástico (geralmente do tamanho de um cartão de crédito) com um chip embarcado nele, para as mais diversas aplica¸cões. As primeiras pesquisas para o desenvolvimento da tecnologia para smart cards datam de 1968 quando dois inventores alemães patentearam sua idéia para o uso de microchips em cartões de plástico, mas não foi só até 1976 que a indústria alcan¸cou a tecnologia necessária para o desenvolvimento dessa tecnologia a pre¸cos aceitáveis. Em 1981 foram feitos os primeiros testes reais de aplica¸cões em smart cards, na Europa e nos Estados Unidos, utilizando os cartões para fins bancários, de identifica¸cão, de pagamentos, etc.

Atualmente tal tecnologia já está sendo comumente usada em várias partes do mundo, e de vários modos distintos. Suas aplica¸cões variam desde acesso a prédios, assinatura digital de aplica¸cões, cartões para acesso a TV paga, e cartões SIM para celulares. Um dos principais objetivos dos desenvolvedores de smart cards é que, no futuro, tal tecnologia possa ser usada como padrão para armazenar todas as informa¸cões do seu proprietário, como por exemplo, carteira de identidade, de motorista, cartão de crédito, etc. substi-tuindo de forma eficiente (e segura) todos os cartões que nós carregamos atualmente na carteira.

(17)

0.3 a linguagem jml 3

A solu¸cão para esse problema come¸cou a aparecer em 1996, quando a Sun lan¸cou a primeira versão da especifica¸cão do JavaCard. JavaCard é uma versão reduzida da linguagem Java, que se adapta aos requisitos limitados dos smart cards e que visava trazer para essa plataforma os benef´ıcios que o Java já oferecia às aplica¸cões desktop (uso de orienta¸cão a objeto no desenvolvimento do software, interoperabilidade e seguran¸ca). Com essa nova plataforma o desenvolvimento das aplica¸cões se tornou mais simples, pois a partir do momento que o desenvolvedor tivesse um cartão que implementasse um maquina virtual JavaCard (JCVM), ele teria dispon´ıvel imediatamente qualquer aplica¸cão que fosse feita usando essa tecnologia.

Devido a essas vantagens o JavaCard rapidamene se tornou o padrão para o desen-volvimento de aplica¸cões smart cards, e já está sendo usada nos cartões SIM, cartões de crédito e cartões pré-pagos, para citar alguns exemplos. No entanto mesmo com tantas vantagens, as aplica¸cões JavaCard não oferecem os recursos necessários para se fazer ver-ifica¸cões que garantam que um programa realmente segue uma dada especifica¸cão. Para se fazer esse tipo de verifica¸cão a linguagem adotada foi o JML, que é o padrão de facto em se tratando da especifica¸cão e verifica¸cão formal de aplica¸cões Java.

0.3 A LINGUAGEM JML

JML é uma Linguagem de Especifica¸cão de Interface e de Comportamento (Behavioral Interface Specification Language - BISL) associada ao Java. Nela a defini¸cão da interface de uma fun¸cão/classe é dada pela própria linguagem Java (nomes de fun¸cões, parâmetros, etc) e o comportamento dado em JML. Enquanto a sintaxe de JML é estreitamente conectada a de Java, sua semântica vem de linguagens como Eiffel [61, 60], e Larch [85, 35, 34, 48, 49], além de usar aspectos do calculo de refinamento [9, 64, 63].

Os principais objetivos dos desenvolvedores de JML foi desenvolver uma linguagem que:

• Oferecesse facilidade para a documenta¸c˜ao de programas Java existes;

• Fosse f´acil de entender pelos programadores Java, inclusive aqueles com um modesto conhecimento matem´atico;

• Pudesse possuir suporte com ferramentas, gra¸cas a uma semˆantica formal.

As especifica¸cões em JML geralmente são escritas juntamente com o código da aplica¸cão, utilizando um tipo especial de comentários: /*@ e @*/, para especifica¸cões em múltiplas linhas, ou //@ para especifica¸cões de uma única linha. JML já vem sendo usada há algum tempo e já existem muitas ferramentas que dão suporte a ela. Para a verifica¸cão estática podemos citar ESC/Java[31], LOOP[43], e o Krakatoa[58], entre outros. Além desses verificadores também encontrar várias aplica¸cões que cobrem geradores de testes, utilitários para documenta¸cão, geradores de invariantes, verificadores dinâmicos, entre outros [7].

(18)

para definir as condi¸cões podem ser escritas usando expressões Java que sejam puras (ou seja, sem efeitos colaterais). Pode-se também utilizar nas expressões alguns constru-tores oferecidos pela própria JML, como \forall e \exists, que são respectivamente os quantificadores universal e existencial, entre outros.

0.3.1 Pré- e Pós-condi¸cões

Em JML as fun¸cões podem ser anotadas com pré- e pós-condi¸cões usando as clausulas

\requires e \ensures. Nas defini¸cões de pós-condi¸cões, os construtores \result e \old podem ser usados para nos referirmos ao resultado do método e ao valor de uma expressão antes da execu¸cão, respectivamente.

0.3.2 Invariantes

Pode-se especificar invariantes de classe em JML usando a expressão \invariant. É também poss´ıvel criar relacionamentos (constraints), que definem a rela¸cão do valor de uma variável antes e após a execu¸cão de um método, usando a clausula \constraint. 0.3.3 Tratamento de exce¸cões

Exce¸cões podem ser tratadas em JML usando as clausulas\exceptional behavior, que permite definir uma nova pré-condi¸cão necessária para ocorrer uma exce¸cão bem como uma pós-condi¸cão que diz qual será o estado do sistema quando a mesma ocorrer , e

\signals que define qual exce¸c˜ao est´a sendo tratada. 0.3.4 Heran¸ca

JML oferece suporte completo a heran¸ca através do uso da palavra-chavealso quando especificando pré- e pós-condi¸cões para um método, para assim descrevermos condi¸cões adicionais em rela¸cão a uma super-classe.

0.4 UMA PLATAFORMA PARA A VERIFICAC¸ ˜AO DE PROGRAMAS JAVA

Para verificarmos com sucesso as nossas programas Java (e JavaCard em especial), nós contamos com uma plataforma de software composta por três aplica¸cões: Why, Krakatoa e haRVey. Cada uma dessas ferramentas será apresentada a seguir.

0.4.1 Why

O Why é uma ferramenta multi-linguagem e multi-provador que, recebendo como entrada um programa anotado, pode gerar obriga¸cões de provas para diversas ferramen-tas. A idéia dessa abordagem vem do fato de que, desenvolver uma boa ferramenta de verifica¸cão requer um esfor¸co considerável, e deve ser deixado para os especialistas. No entanto a gera¸cão de obriga¸cões para esses provadores desse ser o mais fácil poss´ıvel.

(19)

0.4 uma plataforma para a verificac¸˜ao de programas java 5

HOL Light e os verificadores de teorema Simplify e haRVey. Segundo os desenvolvedores, sa´ıdas adicionais podem ser adicionadas rapidamente.

Para alcan¸car essa flexibilidade o Why trabalha internamente com uma linguagem baseada no ML (chamada de WL). Assim, quando o Why recebe um programa anotado como entrada ele primeiro transforma esse programa para sua linguagem interna e gera obriga¸cões de provas para ela (baseado nas anota¸cões dadas). O Why usa então essas obriga¸cões “genéricas” para gerar as obriga¸cões de prova na linguagem especifica do provador.

Um esquema de como funciona esse processo poder ser visto na Figura 0.1.

Figura 0.1. Esquema doworkflow do Why.

0.4.2 Krakatoa

O Krakatoa trabalha em conjunto com o Why para lhe dar suporte a linguagem Java. Ele é responsável por transformar os programas/anota¸cões em Java/JML em obriga¸cões de prova. O funcionamento do Krakatoa é intimamente relacionado com o Why, e com isso ela herda a capacidade de gerar obriga¸cões para vários provadores.

Quando verificando uma programa anotado o Krakatoa segue os seguintes passos [58]:

i) Retirar os “a¸cúcar sintático” de cada especifica¸cão JML, para que no final tenhamos métodos com somente uma clausularequires, umaassignable, umaensures and uma signals, e que invariantes de classe sejam contidos em uma única clausula invariant;

ii) Calcular, para cada método, quando ele vai acessar/escrever na pilha ou aces-sar/escrever em qualquer variável estática existente no programa;

iii) Gerar especifica¸cões na linguagem do Why para o programa (uma para cada classe). Cada um dos arquivos gerados contém declara¸cões de variáveis globais, declara¸cões de métodos bem como fun¸cões externas do Why e declara¸cões de predicados para cada método puro;

(20)

v) Gerar arquivos do Why para cada método que será verificado, contendo programas traduzidos do código Java e das especifica¸cões JML diretamente para o Why.

O trabalho do Krakatoa termina nesse ponto. Após esses passos estarem completos nós utilizamos o Why para gerar as obriga¸cões de provas e o provador para verificá-las. A Figura 0.2 demonstra esse esquema.

Figura 0.2. Workflow do Krakatoa.

0.4.3 haRVey

haRVey é a ferramenta que implementa a técnica de light-weight theorem proving [20, 22]. Tal técnica combina o uso de Binary Decission Diagrams (BDDs) [6, 82, 5] ou SAT-Solvers com o cálculo de superposi¸cão [65] (no caso implementado pelo E prover [74, 75, 76]) para alcan¸car melhores resultados na verifica¸cão automática de problemas.

O haRVey pode verificar problemas do tipo: τ |= φ, onde τ é um conjunto finito de axiomas ax1, ax2, . . . , axn. No caso φ e axi são fórmulas da lógica de predicados,

com quantificadores. Para mostrar a fórmula φ é válida, com respeito a uma teoria

τ, o provador de teoremas utiliza uma prova por refuta¸cão, mostrando que ¬φ é não-satisfátivel, com respeito a teoria τ.

(21)

0.5 técnicas para a simplificação de teorias 7

A sintaxe dos arquivos de entrada do haRVey é bem simples e baseado na linguagem LISP. Ele consiste em duas se¸cões, delimitadas por parênteses. A primeira se¸cão corre-sponde a um conjunto (talvez vazio) de formulas, que vão axiomatizar o problema em questão, e é chamada de teoria. A segunda parte do arquivo corresponde à fórmula que queremos provar, e é chamada de goal. Na Figura 0.3 nós podemos ver um problema simples representado na sintaxe do haRVey.

(

(forall A I E (= (read (write A I E) I) E))

(forall A I J E (-> (not (= I J)) (= (read (write A I E) J) (read A J)))) )

(->

(= (read r a) (read r b)) (or

(and (= (read r x) (read r a)) (= (read r x) (read r b))) (and (not (= (read r x) (read r a)))

(= (read r x) (read r x))) )

)

Figura 0.3. Um exemplo de arquivo do haRVey.

0.5 TÉCNICAS PARA A SIMPLIFICAÇ ÃO DE TEORIAS

Nessa se¸cão serão apresentadas as técnicas que desenvolvemos, bem como os resultados obtidos, à partir dos estudos para a redu¸cão da teoria nos problemas relativos à verifica¸cão de programas Java.

0.5.1 Redu¸c˜ao puramente simb´olica

A técnica puramente simbólica que nós desenvolvemos se baseia na idéia de que, dado um problema a = b, b = c, d= e |= a = c, nós podemos prova-lo utilizando somente os axiomasa=b eb =c, sem a necessidade do terceiro axioma. A nossa idéia inicial foi que os axiomas necessários para a verifica¸cão de um problema, seriam somente aqueles que contivessem s´ımbolos que fizessem parte do conjunto de s´ımbolos do goal. No entanto essa idéia não é verdadeira no caso de problemas como a=b, d=c, b=d|=a=c, onde a rela¸cão entre os axiomas e os s´ımbolos é indireta.

Para contornar esse problema nós reformulamos nosso conceito é decidimos guardar as informa¸cões sobre o relacionamento existente entre os axiomas. Para alcan¸car isso nós particionamos a teoria em conjuntos de axiomas relacionados. Dois axiomas Ax1 e Ax2

(22)

ii) existe um axioma oAx3, tal que,Ax1 ρ Ax3 ∧ Ax3 ρ Ax2.

Adicionalmente ´e definido ρ-Axcomo sendo aρ-classe de todos os axiomas relaciona-dos `a Ax.

Agora suponha um s´ımbolo s, e um conjunto de axiomas associados a s (denotado porχs). Ses aparece no axioma Axi, ent˜ao o novo valor deχsvai serχs ∪ ρ-Axi, se n˜ao

χs ´e deixado sem mudan¸cas. Depois de termos terminado de passar todos os axiomas da

teoria, a semˆantica de um s´ımbolo s ´e definida pelo conjunto χs de axiomas.

Com isso, se nós temos de mostrar que uma fórmula φ é valida, com respeito a uma dada teoria τ, é suficiente que consideremos a união dasρ-classes de axiomas de χs, para

todo s´ımbolo s aparecendo emφ.

0.5.2 Elimina¸c˜ao de quantificadores

A idéia para a técnica de elimina¸cão de quantificadores veio do fato que, alguns ex-perimentos com o haRVey, demonstraram que o mesmo não apresentava resultados satis-fatórios quando trabalhando com uma grande quantidade de equivalências em conjunto com quantificadores universais.

Nosso objetivo com essa técnica foi então tentar reduzir ao máximo poss´ıvel o número de ocorrências de equivalências em um dado problema, eliminando-as e substituindo qual-quer ocorrência do seu lado esqual-querdo pelo valor equivalente do lado direito.

Apesar de ser uma idéia simples, a sua implementa¸cão esbarrou em diversos problemas, que implicaram em restri¸cões para sua utiliza¸cão.

A primeira preocupa¸cão que nós tivemos com essa abordagem foi com a ocorrência de casos como:

a↔b b↔c c↔a

que gerariam substitui¸c˜oes recursivas (e infinitas). Felizmente, no nosso caso, esse tipo de problema n˜ao ocorre devido ao algoritmo que utilizamos para recuperar as

informa¸cões de substitui¸cões (o algoritmo de união-busca).

Um outro problema relativo à recursividade pode ser representado pela seguinte ex-pressão: (forall U V (<-> (f U) (g V (f U)))). Nesse caso nós temos uma ocorrência do termo do lado esquerdo no outro lado da equivalência, o que também geraria substitui¸cões recursivas. Nesse caso nós decidimos simplesmente ignorar essas ocorrências, e manter as equivalências sem altera¸cões.

(23)

momento nós decidimos deixar esse critério como uma op¸cão a ser escolhida pelo usuário para cada problema.

No entanto essas restri¸cões não foram suficientes para nos garantir completude que desejavamos, e por isso decidimos estender as restri¸cões para a substitui¸cão. Agora, além dos critérios anteriores, para uma equivalência ser considerada para substitui¸cão ela tem de satisfazer a seguinte conjectura:

Conjectura 0.1 Se um axioma é uma equivalência universalmente quantificada, tal que um dos seus lados é um átomo, tal que, todas as variáveis nele são universalmente quan-tificadas, nós podemos eliminar esse axioma, e substituir todas as instâncias do átomo pelo outro lado da equivalência, fazendo as devidas substitui¸cões de variáveis.

Com isso se tivéssemos um equivalência(forall U V (<->(f U V) (g a b))), nós ir´ıamos substituir qualquer ocorrência def por(g a b). Assim, se tivéssemos uma expressão(and (f (h b) (h c)) (h d)), ela se tornaria (and (g a b) (h d)).

Adicionalmente a partir desse momento nós também passamos a considerar o lado direito de uma equivalência como candidato para a substitui¸cão, caso o lado esquerdo não possa ser usado.

0.5.3 Estrutura¸c˜ao da teoria

O desenvolvimento dessa técnica é baseado no trabalho de Reif e Schellhorn [72], onde são descritos os passos necessários para se trabalhar com grandes teorias, utilizando teorias estruturadas (no caso é utilizada a ferramenta KIV, que possui uma linguagem de entrada similar a das especifica¸cões algébricas).

Reif e Schellhorn dividem essa t´ecnica em duas partes:

i) Elimina¸cão dos enriquecimentos desnecessários; ii) Elimina¸cão dos axiomas desnecessários;

A segunda parte dessa abordagem já é realizada pela nossa abordagem puramente simbólica (Se¸cão 0.5.1). A primeira parte se baseia no conceito de hierarquias persistentes, ou seja, se temos uma especifica¸cão SPEC, que enriquece MSPEC com um conjunto de axiomas ∆, e uma fórmula φ. Se φ contém somente s´ımbolos presentes em MSPEC, então, se φ é verdadeira em SPEC, então ela também será verdadeira em MSPEC. Com isso nós podemos eliminar todos os ∆ axiomas acrescentados por SPEC de nossa teoria. Para implementar essa estratégia de redu¸cão nós implementamos o algoritmo dado por Reif e Schellhorn, que se baseia em quatro critérios:

i) crit´erio da minimalidade: encontrar uma especifica¸c˜ao m´ınima MSPEC que con-tenha todos os s´ımbolos presentes em sigφ.

(24)

(

[(theory [theory name]

(extends [extended theory]*) (axioms [axiom]*)

)]* )

[goal formula]*

Figura 0.4.Nova sintaxe do haRVey.

iii) critério da especifica¸cão: em uma especifica¸cão elementar, o critério acima também pode ser usado para determinar se uma opera¸cão definida sobre essa especifica¸cão, é hierarquicamente persistente;

iv) critério da recursividade: os três critérios anteriores podem ser aplicados recursiva-mente, até o conjunto de axiomas se estabilizar;.

Adicionalmente, para que pudéssemos escrever especifica¸cões que sejam hierarquica-mente persistentes, nós tivemos de mudar a sintaxe dos arquivos de entrada do haRVey, para uma formato que lembrasse as especifica¸cões algébricas. A nova sintaxe pode ser vista na Figura 0.4.

0.5.4 Resultados experimentais

Para testar o real impacto dessas técnicas para a verifica¸cão de aplica¸cões Java nós usamos como base um conjunto de quatro classes (Lesson1, Arrays, Purse e Flag) que acompanham a distribui¸cão do Krakatoa [56].

Até o presente momento não foram realizados testes com aplica¸cões JavaCard reais, por dois motivos: i) a não disponibilidade de applets JavaCard que se adequem às re-stri¸cões apresentadas pelo Krakatoa; ii) até o momento a verifica¸cão dos exemplos em Java (teoricamente mais simples que applets reais), não se deu de forma 100% satisfatória, e por isso não nos pareceu interessante partir para o teste de aplica¸cões mais complexas;

Os resultados dos experimentos são exibidos nas Tabelas 0.1 a 0.4. A Tabela 0.1 mostra os resultados do número de obriga¸cões de prova verificadas, usando a sintaxe original do haRVey, com cada uma das técnicas que desenvolvemos. Nesse caso a primeira coluna mostra quais técnicas foram aplicadas, a segunda o número de obriga¸cões que foram verificadas, a terceira o número de obriga¸cões que não se pode decidir, a quarta quantas obriga¸cões geraram um time out (para um tempo de 30 segundos), e a última o tempo total que a verifica¸cão levou (para as obriga¸cões verificadas e que não puderam ser decididas). A Tabela 0.2 mostra o tamanho da teoria após o uso de cada uma das técnicas de redu¸cão para cada um dos problemas.

(25)

Verificado N˜ao pode decidir Timed out Tempo de prova

normal 14 0 40 1.821

eql 14 0 40 1.632

tailor 14 0 40 1.684

eqf 14 0 40 1.575

eql + tailor 15 2 37 1.917

eqf + tailor 15 2 37 1.862

Tabela 0.1. Resultados iniciais.

Problema original tailor eql eqf eql + tailor eqf + tailor

Lesson1 79 60 55 51 48 44

Arrays 79 59 55 51 47 43

Purse 79 59 55 51 47 43

Flag 76 60 56 51 49 44

Tabela 0.2.Tamanho das teorias após aplica¸cões das técnicas.

Um ponto importante que foi levantado durante esses experimentos foi um problema apresentado pela ferramenta que implementa o calculo por superposi¸cão, o E prover. O E prover utiliza uma série de heur´ısticas para verificar cada problema e decidir qual estratégia utilizar na sua resolu¸cão. O que ocorreu foi que durante nossos testes o E prover se mostrou extremamente sens´ıvel diante da varia¸cão da estrutura de um problema. Às vezes a remo¸cão de um axioma (não necessário) da teoria poderia mudar a heur´ıstica escolhida pelo E prover, e com isso mudar a resposta do problema (gerandotime outs ou fazendo um problema aparecer como não decidivel).

Para avaliar o impacto dessas heur´ısticas nos nossos problemas nós decidimos refazer todos os experimentos fixando uma heur´ıstica a ser utilizada. Para isso nós escolhemos, dentre os problemas que puderam ser verificados, a heur´ıstica que foi mais utilizada, e for¸camos o E prover a utilizá-la sempre, independente do problema. O resultado disso pode ser visto na Tabela 0.5.

Como pode ser visto a heur´ıstica fixa de fato nos deu melhores resultados (cerca de 20% melhores do que permitindo que o E prover escolha as heur´ısticas). Essas resultados nos mostram que ainda existe muito trabalho a ser feito, numa integra¸c˜ao melhor do

normal 24 1 29 2.340

eql 23 2 29 2.243

tailor 24 1 29 2.302

eqf 23 2 29 2.208

eql + tailor 23 2 29 2.253

eqf + tailor 23 2 29 2.218

(26)

Problema original tailor eql eqf eql + tailor eqf + tailor Lesson1 5 - 6 5 - 6 3 - 4 2 - 3 3 - 4 2 - 3

Arrays 15 - 18 15 - 18 12 - 15 10 - 13 12 - 15 10 - 13 Purse 14 - 30 14 - 30 11 - 27 10 - 23 11 - 27 10 - 23 Flag 13 - 23 13 - 23 12 - 22 9 - 18 12 - 22 9 - 18

Tabela 0.4. Tamanho das teorias usando a teoria estruturada.

E prover com o haRVey, ou mesmo nos crit´erios que o E prover usa para escolher suas heur´ısticas.

normal 29 4 21 41.757

eql 28 5 21 41.861

tailor 29 4 21 42.354

eqf 28 5 21 27.830

eql + tailor 28 5 21 42.771

eqf + tailor 28 5 21 27.946

Tabela 0.5.Resultados obtidos com a heur´ıstica fixa.

0.6 CONCLUS ˜OES E TRABALHOS FUTUROS

O nosso trabalho se concentrou no estudo de técnicas para a redu¸cão de teorias. Tais técnicas se mostraram de extrema valia nos casos estudados, e achamos que elas devem ser consideradas como parte vital em qualquer situa¸cão onde se esteja trabalhando com teorias grandes (o que é comumente o caso). O uso de tais técnicas nos permitiu ganhos de até 100% no número de obriga¸cões verificadas, e a obten¸cão de teorias, em média, cinco vezes menores que as originais.

Tais resultados também serviram para demonstrar que uma verifica¸cão totalmente automática de programas Java especificados em JML é poss´ıvel com a utiliza¸cão do haRVey. Nossos estudos também revelaram um problema com o provador de teoremas utilizando pelo haRVey (o E prover [74, 75, 76]) no que diz respeito à escolha de heur´ısticas para o processo de verifica¸cão. Tal problema demonstra a necessidade de um estudo mais detalhado em coopera¸cão com os autores do E prover.

0.6.1 Trabalhos futuros

Os próximos passos necessários para tornar o conjunto de ferramentas o mais au-tomático e completo poss´ıvel incluem:

• Melhor esquema para a determina¸c˜ao de heur´ısticas;

(27)

0.6 conclus˜oes e trabalhos futuros 13

• Adicionar ao Krakatoa suporte para cobrir completamente a sintaxe do Java e de JML;

Adicionalmente, nós planejamos o estudo (e poss´ıvel inclusão no haRVey) de uma técnica baseada em particionamento, apresentada em [1, 55], que divide os axiomas de uma dada teoria em subconjuntos, baseado na assinatura de cada axioma. Cada um desses subconjuntos é adicionado à obriga¸cão somente quando necessário, fazendo com que assim estejamos trabalhando sempre com um conjunto m´ınimo de axiomas.

(28)

INTRODUCTION

The use of formal verification methods can be traced back to the beginning of computer science, with the seminal works of Hoare. The main objective of such methods is to verify if a “model” of a system (being it hardware or software) satisfies certain conditions by having a given attribute. They usually differ from traditional verifications, in the sense that they convey a promise of mathematical certainty, while “normal” verifications, are, usually, based on testing (or simulating, when dealing with hardware). With the increasing size of the systems, the use of tests and simulations has become prohibitively expensive. Current systems are so large that exhaustive coverage is practically impossible. Also, tests can be used only to discover the presence of bugs, and not to prove their absence. To accomplish a full verification of a system, the use of formal (and automated) verification has become of extreme importance. Such verifications became a synonymous of “computer-aided verification”, where the computer is used to speed up the steps of a verification. Generally, formal verification, may be viewed as theorem proving in a given logic, but, in practice, it is divided in various subcategories. Such categories usually offer to the user a trade off between the expressiveness and automation. Such subcategories (and their relative level of expressiveness/automation) can be seen in Figure 1.1.

Figure 1.1.Level of automation/expressiveness (inspired from [46]).

While the verification of models by hand can guarantee a greater expressiveness, it is clear that, even with small systems, this approach can not be eﬀectively used. Following

(29)

1.1 embedded systems and formal verification 15

the manual approach in level of expressiveness, we have the use of proof assistants. Such tools, while not fully automated, can be used to automate some of the steps necessary in a verification. If one wishes full automation, in the expense of expressiveness, he may choose among theorem provers or model checkers. The main diﬀerence among them is that, while the later oﬀers a great degree of automation, it can only work over finite systems, while the former can work on infinite ones.

While the use of formal methods for software verification has already been around for many years, oﬀering the possibility to show that a software is free of errors, they generally came at a high cost, demanding considerable mathematical skills to developers and costing many additional hours of development time to assist with discharging proof obligations [14]. Today this scenario has changed, with the advances in formal verification, the rise of computers’ raw power, and the fall of their prices, the costs of software verification are really balanced by the benefits it brings.

But, even with these advances, only a small portion of the market is using the full power of formal verification. Some of the main reasons for this are the belief that only people with considerable mathematical skills can write formal specifications, and the lack of robust tool support. The main fields of the industry that are adopting formal methods are those where the correctness of the software is critical, as, for example, nuclear plants, air control systems and embedded systems.

1.1 EMBEDDED SYSTEMS AND FORMAL VERIFICATION

Embedded systems usually posses small pieces of software that are used as a part of a larger system (like the break system of a car, a cell phone and so on). This kind of systems, are, by nature, more sensible than the systems we are used to work with. First, they can not be “patched” after they have been delivered, or even if they can, the cost to do it would be extremely expensive. This kind of systems need to be done right in the first delivery, and not just that, but they can not be “free of bugs” in the traditional no bugs were found until now sense of the word. These systems need to be completely trustworthy, in the sense that it is (almost) impossible to them to behave incorrectly.

One special case that is getting increasing attention is the use of smart cards. Smart cards are a credit card like piece of plastic with memory and a micro processor on it. Currently they are being used in various countries for storing information of its users, for pre-paid cards, and in cell phones. A problem that the smart cards faced at their beginning was the fact that, while there were various standards that dealt with the physical layer of the card, there were not a standard for the software layer, what lead to a lack of reuse, and the necessity to create new projects always from scratch. One of the technologies responsible for overcoming this is the JavaCard standard. JavaCard is a subset of the Java language, which allows developers to write smart cards applications at a high level, and with all the benefits from the Java language (security, interoperability, etc).

(30)

statements about the programs, and, as JML is a formal language, we can use it to make automatic verifications. Some tools have already been developed with this objective in mind, like Krakatoa, LOOP, Jack, ESC/Java, etc.

The downside of most of these tools is that they usually use a proof assistant to reason about the programs. While this gives them a high degree of correctness, the verification of JML specifications using proof assistants can be very tedious and time consuming task. Other approaches (like ESC/Java) try to bring a full automated decision procedure to the JML world, but to do this it allows itself to be less “precise” than the other tools, by not guaranteeing that it will find all the problems, and stating that it may return false positives.

1.2 OUR APPROACH

In [15] we developed an early study to compare two tools, the ESC/Java [31] (that uses the Simplify [17] theorem prover) and Krakatoa (using the Coq proof assistant [80, 36, 4]). Such tools were used to specify a program written for SASHIMI [40, 41, 42] (a tool to automatically generate embedded systems based on Java programs). While the former is a characterized as being an Extended Static Checker (i.e. a tool to automatically find common mistakes in the source code), the latter is a tool for static verification of the code using a proof assistant for that.

In our experiments we have found strong and weak points in both tools. The first thing that we must note is that ESC/Java is an extended static checker and not a static checker, so, as a design decision, to increase the automation of the tool, it was chosen to leave it unsound and incomplete. The drawback of this is that we may have lots of false negatives and also some errors might possibly be left undetected. On the other hand (part of) the Krakatoa approach to the problem aims to a sound and complete result but with a limited level of automation. So, neither of the tools was, at that time, delivering what day-by-day users would need: reliability and automation.

The objective of our work is then to start the development of techniques, that would allow us to verify JML specifications of JavaCard applets, fully automatically, and with a great degree of confidence. To do this we plan to use the automated theorem prover haRVey. Previous results obtained with it show that it can be used in very diﬀerent environments, while still reaching a notable level of success. These resulrs were a very encouraging motivation to use the haRVey theorem prover to verify JML specifications, specially when we consider the benefits that would be brought by automatically discharg-ing proofs without the need of human interaction. To accomplish this, we have used the Krakatoa tool to build proof obligations that would be verified using haRVey. Krakatoa is a tool that generates proofs obligations for various theorem provers and proof assistants based on the Java/JML program/specifications. Krakatoa was our tool of choice because Why, the tool responsible for generating the various outputs for Krakatoa, already had support for haRVey, so the transition would be less problematic.

(31)

1.3 next chapters 17

reduction of the size of the theory allow many problems, that could not be verified, to be checked, since it would reduce the search space of the theorem prover. Even problems that can already be verified would benefit with a speed up in the time needed for the verification when reduzing the background theoriy. In our experiments, we discovered that haRVey was having problems trying to prove some basic Java classes that come with the Krakatoa distribution. This was happening because the Krakatoa/Why approach generates proof obligations with a theory composed of a very large set of axioms (mostly of which are not used). So our work has focused by now on developing techniques to provide such sound reduction of the background theory. We will present here three tech-niques we have developed to accomplish this, and how they influence the verification of the problems we had at hand.

1.3 NEXT CHAPTERS

The next chapters of this work are divided as follows.

Chapter 2 will give a brief introduction to the history and concepts behind the func-tionality of smart cards in general and JavaCard specifically, and show why it is the technology of choice for so many developers;

Chapter 3 presents the JML language, that is used to specify Java/JavaCard programs. We will show its underlying concepts, such as design by contract and two tiered approach to specification, as well as its syntax;

Chapter 4 will present the tools with which we are working with, namely, Krakatoa, Why and haRVey, presenting their technology and basic functionality.

Chapter 5 shows the work we have developed so far, and presenting the algorithms we have used to reduce the size of the theory of the Java examples, as well, as the results we have obtained with each of these approaches;

(32)

JAVA BASED APPROACH TO SMART CARDS

Before the appearance of JavaCard all the smart card companies had to design their own proprietary solutions for their cards. With the use of JavaCard, various possibilities were brought, such as the ability to have more than one application running on the same card, interoperability between all manufacturers that use JavaCard and the possibility to dynamically add new functionalities to the card after it is issued (post-issuance). Another major impact of the use of JavaCard is the possibility to use all the power of the object orientation paradigm during the development cycle, and having available the already established base of Java programmers.

In this section, we will present the basic ideas behind the concept of the smart cards in general and of JavaCard in particular, showing the impact and the importance that such language is having on the growth of the “mobile computing” technologies.

2.1 SMART CARDS

2.1.1 History

Today one of the greatest objectives in the computer field is that of mobile computing, trying to bring the usefulness of computers on the go. Some obvious examples of this are PDAs, laptops and smart-phones that oﬀer to their users the ability to load their most important programs and data wherever they are. But one of the most widespread methods for mobile computing is the smart card. Smart cards are diﬀerent from the more traditional mobile devices in the sense that they work locally, keeping the information and programs always accessible for the user [45]. The smart cards are already being widely used in Europe and USA where they are being employed to store information about medical history or bank account of the owner. But the smart cards proponents aim higher, with the objective to be a centralized medium of information, where it would store all the data about its owner (not only medical and bank info, but also driver license, ID, etc), and eventually all we would need would be just one card that would replace the half-dozen cards that we carry in our wallet nowadays.

The term smart card usually refers to a plastic card (of the size of a credit card) with a chip that holds a micro-processor and a data storage unit. The development on smart cards started back in the 60s, where, in 1968, two German inventors patented their idea of using plastic cards as a carrier for micro chips, but it was not until 1976 that the industry was capable of producing such cards with an acceptable price. In 1981 the first field experiments were conducted around the globe using banks, telepayment and identification cards, among other projects. After that the smart cards usage spread fast with companies like MasterCard, Visa and France T´el´econ adopting it. Today, smart cards are used in a wide variety of applications ranging from building access systems to

(33)

2.1 smart cards 19

electronic payment schemes, from conditional access methods for satellite TV to electronic signature applications, and from loyalty programs to public transportation applications [38], not to mention the use of them in the current GSM mobile phones in the form of SIM cards [66].

As defined in [73] the smart cards can fit into three possible categories, these being:

Memory cards are cards that possess a chip that can store an average of 4KB of in-formation, but do not have a processor inside it. Because of the absence of a processor they must be used with an external reader to access these data and per-form the required operations. The typical usage of memory cards is with pre-paid disposable-cards, like phone cards;

Optical memory cards look like a CD glued on top of a card and can store an average of 4MB of data. As CD-Rs, once the data is written it can not be erased. It main usage is to keep non-changeable information, like medical or driving records;

Microprocessor cards are the ones that are really “smart”, since they do not have just a memory chip, but also a processor that can perform operations on the data stored on the card.

In the rest of our work, we will be focusing on microprocessor cards, and whenever we say smart card we will be referring to them. In the next sections we will present some more details about this kind of card, showing its components, its limitations and the standards that guide the industry of smart cards.

2.1.2 Characteristics of smart cards

A microprocessor smart card diﬀerentiates itself from the traditional magnetic strip card and memory card in the sense that while the latter is used just to store information, the former can make computations on that information. The microprocessor card can add, delete, and manipulate information on the card, while a memory-chip card can only undertake a pre-defined operation [38, 73].

The main components of a smart card are: an operating system that controls access to the card’s data and functions; data stored in EEPROM; and RAM for transient results. Typically, the CPU of choice was old 8 bits processors, but nowadays some cards are already shipping with 16 or 32 bits, RISC or CISC CPUs [19]. Some smart cards also oﬀer additional features, such as fingerprint sensors, wireless interface, or even displays. Some cards will not even appear to be a card, such as the JavaRing [38]. The main constraints we face when dealing with these devices are the low-power CPU, low throughput serial I/O and little memory (typically 1-4 KB RAM, 32-128 KB ROM and 16-64 KB Flash RAM) [18].

(34)

may be used in authorization process, such as building access. They can also be contact or contactless. Contact smart cards work by communicating via physical contact between a card reader and the smart card. Contactless smart cards communicate using a radio frequency signal, with a typical range of less than 2 feet [66].

Smart cards do not contain any battery and become active just when inserted in a card reader. When in the reader, the smart card initializes itself and stays in a passive mode waiting to receive a command from the reader (these commands are transferred using the APDU protocol that will be presented in Section 2.1.4). After it receives the command, it makes the necessary processing and sends back the result to the reader.

The typical participants in the life cycle of a smart card are illustrated in Figure 2.1, and are [18]:

Semiconductor manufacturers are in charge of chip design and mass production;

Smart card manufacturers are associated here with smart card software producers and are responsible for embedding issuers’ requirements;

Card issuers traditionally have more business/behavioral considerations while deploy-ing and managdeploy-ing smart card-based solutions;

Service providers design and deploy (under the control of issuers) value-added services; and

Users benefit from those services.

Figure 2.1.Smart card production life cycle (from [18]).

The first smart cards were developed as monolithic systems, that is, all the require-ments for its functionality were already embedded during manufacturing and there was no possibility of reuse. This approach incurred high maintenance costs, and, with the years, the developers came to see that the best approach was to make the cards as modular as possible and add the possibility to dopost issuance, that is, to add new functionalities to the card after it has been shipped. This kind of cards usually rely on a virtual machine, both for portability (a single application can be loaded into several diﬀerent cards, i.e. relying on diﬀerent hardware, without needing to be modified) and for security (it is usually easier to prove or ensure the safety with intermediate codes) [18]. With these ideas in mind the life cycle of the smart card software can be seen in Figure 2.2 with each stage described as follows [18]:

(35)

2.1 smart cards 21

Init/Instantiate: is done after the download, depending on the nature of the application (ready-to-run or ready-to-load applications). With this, the application registers itself on the card, and makes any necessary initializations (since the card is always running it is done only once);

Use: after that the application is ready for use by the client.

Figure 2.2.Life cycle of the smart card software (from [18]).

2.1.3 Standards

One of the most known problems for the wide acceptance of smart cards was the lack of a standard. When the smart cards started being mass commercialized, an application developed to run in one card could not run on the other without having to be rewritten from scratch. This is a great problem when considering that if the smart cards are really intended to be used as a unique device for all companies, there should be a standardization for the cards and on the software/platform in which they will run.

The main standards used today come from the International Organization for Stan-dardization (ISO). Those standards rule nearly all smart card features from physical char-acteristics to application management. These standards are under the name of ISO7816-x and cover physical characteristics (ISO 7816-1); contact location and dimension (ISO 7816-2); and electrical signals along with low-level transport (ISO 7816-3) and high-level application (ISO 7816-4) communication protocols (the APDU protocol that will be ad-dressed later). The ISO7816 also addresses other issues such as numbering systems and registration procedures for smart-card applications, taglength-value data structures, en-hanced smart-card commands (mutual authentication, SQL access, and encryption), and more [38, 18, 73].

But the ISO is not the only standard that is on the market, there are also others, such as that from European Telecommunications Standards Institute, Groupe Sp´eciale Mobile subscriber identity module (GSM SIM), EMV Payment System specification by Europay, MasterCard, and Visa, Visa’s Open Platform specification and the international Common Electronic Purse Standard (CEPS) [38, 73].

2.1.4 The APDU protocol

(36)

former is sent by the reader to the card when requesting an operation, and the latter is sent by the card to the reader in response to the request (if no error occurs the card should always send a response APDU, even if it contains no data) [32].

The structure of the command APDU is shown in Figure 2.3. As seen in the figure, the first four fields are always mandatory and their representation is as follows [66]:

CLA (1 byte, required): identifies an application-specificCLAss of instructions. Valid CLA values are defined in the ISO 7816-4 specification:

0x0n, 0x1n: ISO 7816-4 card instructions, such as for file access and security operations

20 to 0x7F: Reserved

0x8n or 0x9n: ISO/IEC 7816-4 format you can use for your application-specific instructions, interpreting ’X’ according to the standard

0xAn: Application- or vendor-specific instructions

B0 to CF: ISO/IEC 7816-4 format you can use for application-specific instruc-tions

D0 to FE: Application- or vendor-specific instructions FF: Reserved for protocol type selection

INS (1 byte, required): indicates a specific INStruction within the instruction class identified by the CLA field (a table showing the possible INS when CLA is 0x0n is shown in [66]).

P1 (1 byte, required): defines aParameter (1) for the instruction, used to qualify the INS field, or for input data.

P2 (1 byte, required): defines an additional Parameter (2) for the instruction, used to qualify the INS field, or for input data.

Figure 2.3. Command APDU (from [66]).

The next three fields are optional and are used when there is need to exchange data with the card. They are defined as follows:

Lc (1 byte, optional): the Length (in bytes) of the data field of the command;

(37)

2.1 smart cards 23

Le (1 byte, optional): the maximum Length (in bytes) of the data field of the expected response.

The response APDU format is given in the Figure 2.4 and has two required fields and one optional. Their meanings are the following:

Data field (variable length, determined by Le in the command APDU, optional): contains the data returned by the applet.

SW1 (1 byte, required): the Status Word 1.

SW2 (1 byte, required): the Status Word 2.

The value of the status words varies depending on the result of the computation, so to indicate if the operation terminated normally or if an error happened. A representation of the possible values for SW1 and SW2 is given in Figure 2.5.

Figure 2.4.Response APDU (from [66]).

Figure 2.5.Status Words response code (from [66]).

(38)

2.2 JAVACARD

2.2.1 History

The adoption of the Java technology in the smart card market has been pushed over the years because of the advantages that it would bring, such as giving the ability to the well established Java community to quickly start developing smart card applications, the possibility to download new “applets” (as JavaCard programs are known) to an already issued card (post-issuance), not to mention Java portability and security. These factors have made Java the language of choice to extend the smart cards possibilities, even considering that it is an interpreted language and could slow down the system [3, 73].

The history of JavaCard [3, 73] begins in 1996 when the first steps to bring the Java technology to smart cards were taken, with the first JavaCard API specification being given by Sun based on the work made by Schlumberger [3, 62]. This first JavaCard Specification was limited only to the description of the general goals and the architecture of JavaCard. In the next year Gemplus and Schlumberger joined forces and founded the JavaCard forum. Still in this year smart card manufacturers such as De La Rue, Bull, and Gieseke & Devrient (G&D) joined the forum and released the version 2.0 of JavaCard specification at the end of 1997. This new version of the specification was a lot more detailed since it contained more details about the API, such as how to access the underlying memory and cryptographic functions and the JavaCard Virtual Machine (JCVM), a smaller and simpler JVM that sheds many of standard Java’s features. Even then the first release of version 2 was still incomplete in the sense that lots of details were still missing leaving most of the design decisions still in the hands of the developers. The result of this was that lots of applets were still incompatible, even at source code level [3].

In 1998 Visa introduced the Visa Open Platform (VOP) smart card specification, defining an architecture for managing applications on multi application smart cards (that is, the possibility to download and install an applet in a secure way). Since the beginning this platform was aimed to the JavaCard products and with security in mind. When, in 1999, Visa opened this standard and renamed it to Open Platform 2.0 neither Sun nor the JavaCard forum had any security platform specified, so Visa’s Open Platform was chosen as the de facto standard for the matter [3].

In 1999, when Sun introduced the JavaCard 2.1 specification, it oﬀered the support for byte code verification using the scheme of code signing [3]. Even then Sun did not claim to completely trust this method and introduced the use of software firewalls. With this, objects are explicitly associated with their owning applets, and additional checks are done in the JCVM to ensure proper access rights for each object. In this version security was also a concern with the cryptographic API being extended to numerous classes. Also, for the first time, any applet could run in any smart card that was compliant with the JC2.1 specification using Converted Applets (CAP).

(39)

2.2 javacard 25

continue to be the dominating technology with respect to high-end microprocessor-based smart cards [33]. To follow this increasing demand there is already work on the specifica-tion of JavaCard 3.0 that aims to bring the JavaCard technology closer to the standard Java. The objective with this is to make a better use of the new hardware technologies (32 bits processors), to enable a wider development of JavaCard by developers already famil-iar with standard Java and to bypass the bottleneck of the specific smart card ISO7816 communication protocols [33].

2.2.2 JavaCard Overview

A JavaCard is a regular smart card in the sense that it conforms to all the smart card standards, the only diﬀerence is that it possesses a Java Virtual Machine implemented on its ROM. The JCVM (JavaCard VM) controls the access to all smart card resources, such as memory and I/O, and thus essentially serves as the smart card’s operating system. The JCVM is able to run applets that were written in a subset of the full Java language. Since the processing and storage power of smart cards is a lot inferior when compared to desktops, some restrictions needed to be made in the JavaCard design. So some features present in the standard Java were wiped out of JavaCard. The main unsupported components, for JC2.0, as stated in [79, 66], are:

Dynamic Class Loading: A JavaCard system is not able to load classes dynamically. Objects are created in the card either into the card’s ROM during manufacturing or through post-issuance;

Security Manager: The security model of JavaCard systems diﬀers from standard Java in fairly significant ways. Language security policies are implemented by the virtual machine. There is no Security Manager class which makes policy decisions on whether to allow operations;

Threads: The JavaCard Virtual Machine does not support multiple threads of control;

Cloning: JavaCard does not support cloning of objects. JavaCard’s version of class Object does not implement a clone() method, and there is no Clonable interface provided.

Garbage Collection & Finalization: Usually the JCVM does not implement a garbage collector, since objects once created will live “forever” in the card. Explicit dealloca-tion of object is forbidden, like in the regular version of Java. The methodfinalize() of the classes will not be necessarily called, and thus the developers should not rely on this behavior.

(40)

The main package of the JavaCard API is the javacard.framework. This package defines the interfaces, classes, and exceptions that compose the core JavaCard Framework. It defines important concepts such as the Personal Identification Number (PIN), the Application Protocol Data Unit (APDU), the JavaCard applet (Applet), the JavaCard System (JCSystem), and a utility class. It also defines various ISO7816 constants and various JavaCard-specific exceptions. Other packages that are related to the JavaCard are java.io,java.lang,java.rmi andjavacard.security. A complete list of these classes can be seen in [66].

The VM for the JavaCard platform is implemented in two parts, one external to the card and the other running inside the card itself. The on-card virtual machine is the JavaCard Runtime Environment (JCRE) and is responsible for interpreting the bytecode, managing classes and objects, and so on. The external part of the VM is a development tool, typically referred to as the JavaCard Converter tool, which loads, verifies, and further prepares the Java classes in a card applet for on-card execution. The output of the converter tool is a Converted Applet (CAP) file, which contains all the classes in a Java package in a loadable, executable binary representation. The converter is responsible for verifying if classes conform to the JavaCard specification [66].

After an applet has been developed and converted to the CAP format it is ready to be loaded into the card. The life cycle of an applet after loading in the card can be expressed by the methods it uses to communicate with the JCVM and the external world. Those methods are install, register, select, deselect and process. The install method is called when the applet is first downloaded in the card and its main function is to create an instance of the applet that will be used for the rest of the life of the card. The install function must call the method register, which is responsible for registering the applet inside the card, by assigning an AID (Applet IDentifier) to this instance. The AID is a unique number that identifies each applet inside a card [32].

The next methods deal with the communication of the card with the external world. As the JavaCard is an environment with multiple applications, each applet must be told explicitly when the communication starts and finishes. This is done using the methods select and deselect. When the reader wants to communicate with a certain applet, its select method is called and it must make any preparations to start communicating. When the communication ends the deselect method is called and the applet is no longer nec-essary. It is important to note that the applet is not destroyed and recreated at each call. The only time an object may be created in the card is during the execution of the install method, and then it is never destroyed. The communication itself is made using the method process, which has as parameter an APDU object coming from the reader. The applet receives this APDU, processes it, to find out which actions it needs to do, makes the necessary actions to fulfill its objective and then sends a response to the card using the method sendBytes of the APDU class.

2.2.3 A JavaCard example