Resumo das Apresentações Orais Apresentação 1 Título

(1)

Resumo das Apresentações Orais Apresentação 1

Título

Influence Diagnostics in Generalized Symmetric Linear Models.

Autores

Cristian Villegas, Gilberto A. Paula, Francisco José A. Cysneiros, Mauel Galea.

Resumo

The aim of this paper is to introduce generalized symmetric linear models (GSLMs) in the same sense of generalized linear models (GLMs), in which a link function is defined to establish a relationship between the mean values of symmetric distributions and linear predictors. The class of symmetric distributions contains various distributions with lighter and heavier tails than normal and hence ofers a more flexible basis for analyzing symmetric data. An iteratively reweighed least squares (IRLS) algorithm is derived to obtain maximum likelihood estimates. The local influence methodology is applied to study the sensitivity of the maximum likelihood estimates under some usual perturbation schemes, such as case-weight, response variable, continuous explanatory variable and scale parameter perturbations. We also discuss generalized leverage and residual analysis. Finally, an illustration is given in which the methodology developed in this paper is applied to a real data set.

Apresentação 2

Título

Ajuste de Mdelos Lineares Mistos para Experimentos Aleatorizados em Blocos com dois Fatores Longitudinais.

Autores

Marina Rodrigues Maestre, César Gonçalves de Lima, Elias Silva de Medeiros, Rafael Otto.

Resumo

Em estudos de fertilidade de solo é comum realizar medidas repetidas de uma ou mais variáveis respostas na mesma unidade experimental ao longo de duas direções, como por exemplo, tempo e profundidade, ou profundidade e distância de certa origem.

Nesses estudos longitudinais admite-se a presença de heterogeneidade de variâncias nas diversas ocasiões e de correlações não nulas entre as medidas repetidas. Para analisar esse tipo de dados podem-se utilizar modelos (lineares ou não lineares) mistos, que têm o atrativo de explicar de forma conveniente a variabilidade existente entre unidades experimentais (devida aos tratamentos) e as correlações dentro das unidades experimentais (devida as medidas repetidas). O objetivo do presente trabalho é ajustar um modelo para explicar o volume de massa seca de raiz de cana- de-açúcar em função de quatro doses de nitrogênio, de um experimento casualizado em blocos com medidas feitas em diferentes profundidades e distâncias da linha de plantio da cana. Na escolha do melhor modelo utilizou-se o teste da razão de verossimilhanças, o critério de informação de Akaike e gráficos de diagnóstico dos resíduos condicionais padronizados. No modelo final foram incluídos efeitos aleatórios no coeficiente de primeiro grau e no intercepto do polinômio de segundo grau, considerando a estrutura de covariâncias de componentes de variância para as medidas repetidas.

(2)

Título

Análise de Resíduos em Modelos de Tempo de Falha Acelerado com Efeito Aleatório.

Autores

Elisângela da Silva Rodrigues, Dione Maria Valença, Julio da Motta Singer.

Resumo

Apresentamos técnicas de análise de resíduos desenvolvidos para modelos lineares mistos para avaliar o ajuste de dados de sobrevivência correlacionados por meio de Modelos de Tempo de Falha Acelerado (MTFA) com efeito aleatório. Propomos um procedimento de imputação para as informações censuradas e consideramos três tipos de resíduos, nomeadamente,os resíduos marginais, resíduos condicionais e os preditores dos efeitos aleatórios. Ilustramos as propostas de ajuste e de análise de resíduos a um conjunto de dados reais envolvendo tempos entre falhas de equipamentos de subsuperfície de poços de petróleo.

Título

Kalman Filter Estimation for a Regression Model with Locally Stationary Errors.

Autores

Guillermo Ferreira, Alejandro Rodriguez, Bernardo Lagos.

Resumo

In this paper, a methodology for estimating a regression model with locally stationary errors is proposed. In particular, we consider models that have two features: time- varying trends and errors belonging to a class of locally stationary processes. The proposed procedure provides an efficient methodology for estimating, predicting and handling missing values for non-stationary processes. We consider a truncated infinite- dimensional state space representation and, with the Kalman filter algorithm we estimate the parameters of the model. As suggested by the Monte Carlo simulation studies, the performance of the Kalman filter approach is very good, even with small sample sizes. Finally, the proposed methodology is used in two real life applications.

Apresentação 5 Título

Least Square Estimation for a class of Locally Stationary Processes.

Autores

José Piña, Guillermo Ferreira, Wilfredo Palma, Mauricio Zevallos.

Resumo

In this work we derive some asymptotic properties of the least square estimator (LSE) in regression models with time-varying trend and stationary errors. The smoothness of the time-varying-trend guarantees that this model can be locally approximated by stationary processes, leading to a particular class of locally stationary (LS) process. We establish conditions to ensure the consistency of the LSE and provide precise convergence rates for its variance. In addition, we demonstrate that the central limit theorem holds for this estimator. The finite sample behaviour of the LSE is illustrated

(3)

by Monte Carlo simulations. Finally the proposed methodology is applied to the IPC returns.

Título

Processos Pontuais com Heterogeneidade Espacial.

Autores

Jony Arrais Pinto Junior, Marina Silva Paez, Dani Gamerman.

Resumo

Um padrão pontual é um conjunto de pontos que representa as localizações de um determinado evento de interesse. Neste caso, o evento é conhecido e a localização do ponto é que é aleatória. Processos pontuais espaciais são úteis como modelos estatísticos na análise de padrões pontuais. Este trabalho propõe um modelo para padrões de pontos que inclui covariáveis espaciais e individuais, bem como interações entre as mesmas. O modelo proposto é uma extensão do trabalho de Liang et al.

(2008) e permite que o efeito das covariáveis individuais e das interações possam variar ao longo do espaço. A performance do modelo é comparada com o modelo que considera todos os efeitos das covariáveis ?xos em uma aplicação de óbitos cerebrovasculares no município do Rio de Janeiro.

Título

Truncated Normal Nonlinear Regression Models for Soil-water Characteristic Curves.

Autores

Carolina Costa Mota Paraíba, Carlos Alberto Ribeiro Diniz, Aline de Holanda Nunes Maia, Lineu Neiva Rodrigues.

Resumo

In the present paper, we propose an alternative approach for estimating soil-water characteristic curves based on truncated normal nonlinear regression models.

Maximum likelihood estimator of the curve parameters are obtained by direct maximization of the likelihood function. Simulation studies are provided to assess the quality of estimates for the proposed regression model and diagnostic analysis are considered to check for model adequacy. We also provide a comparison study, based on simulation results, between the proposed methodology and the usual nonlinear least squares procedure. A real data set is analyzed using the proposed methodology.

Título

Modeling the Latent Covariate in the Linear Regression Model with Measurement Error Using Finite Mixtures of Skew-Student t Distributions.

Autores

Celso Rômulo Barbosa Cabral, Víctor Hugo Lachos, Camila Borelli Zeller.

Resumo

(4)

In regression models, the classical normal assumption for the distribution of the random observational errors is often violated, masking some important features of the variability present in the data. Some practical actions to solve the problem, like the transformation of variables to achieve normality, are often kept in doubt. In this work we present a proposal to deal with this issue in the context of the simple linear regression model when both the response and the explanatory variable are observed with error. In such models, the experimenter observes a surrogate variable instead of the covariate of interest. We extend the classical normal model by modeling jointly the unobserved covariate and the random errors by a finite mixture of a skewed version of the Student t distribution. This approach allows us to model data with great flexibility, accommodating skewness, heavy tails and multi-modality. We develop a simple EM- type algorithm to proceed maximum likelihood inference of the parameters of the proposed model, and compare the efficiency of our method with some competitors through the analysis of some artificial data.

Título

Modelo de Resposta ao Item com Controle da Heterogeneidade Atribuída a Fatores Conhecidos.

Autores

Rômulo Andrade da Silva, Afrânio Márcio Corrêa Vieira.

Resumo

Uma das pressuposições, no processo de estimação dos parâmetros dos modelos tradicionais de resposta ao item, é a independência entre as respostas de diferentes indivíduos. Porém, muitas vezes essa suposição é relaxada, por exemplo, quando aplicada em larga escala nas avaliações de sistemas educacionais. A proposta é usar potenciais fontes de heteronegeidade, como classes socioeconômicas e origem de escola pública ou privada, geralmente desprezadas, como variáveis explicativas de um efeito aleatório multiplicativo no modelo de Rasch. Esse efeito, consequentemente, acomodará a superdispersão nos dados e controlará a pressuposição de independência condicional entre clusters de alunos. Contudo, o trabalho tem como meta apresentar nova abordagem dos modelos de resposta ao item que estime de forma mais precisa os parâmetros, útil na fase de calibração, levando em conta a heterogeneidade atribuída a fatores conhecidos.

Título

Promotion Time Cure Rate Model with Random Effects: an Application to a Multi-centre Clinical Trial of Carcinoma.

Autores

Diego Gallardo Mateluna, Heleno Bolfarine, Antonio Carlos Pedroso de Lima.

Resumo

This paper extends the cure rate model considered in Lopes and Bolfarine (2012) by considering random effects in both, the probability of cure and the survival function for individuals that are at risk. The model is parametrized in terms of the cured fraction which is then linked to covariates. The estimation is based on the restricted maximum likelihood (REML) approach proposed in McGilchrist and Yau (1995). Simulation

(5)

studies are performed and results based on a real data set are presented indicating good performance of the proposed approach. Keywords: Promotion time cure rate model, Random effects, REML.

Título

A New Non-default Rate Regression Model for Credit Scoring Data: an Application to a Brazilian Bank Personal Loan Portfolio Data.

Autores

Francisco Louzada, Vicente G. Cancho, Gladys D.C. Barriga.

Resumo

In this paper we propose a new no-default rate survival model. Our approach enables diferent underlying activation mechanisms which lead to the event of interest. The number of competing causes which may be responsible for the occurrence of the event of interest is assumed to follow a geometric distribution while the time to event is assumed to follow a inverse Weibull distribution. As an advantage our approach may scan all underlying activation mechanisms from the first to last one based on order statistics. We explore the use of maximum likelihood estimation procedure. Simulation studies are performed and experimental results are illustrated based on a real Brazilian bank personal loan portfolio data.

Título

Estimadores Baseados na Técnica IPCW para Dados de Sobrevivência com Fração de Cura.

Autores

Gisela Tunes, Antonio Carlos Pedroso de Lima.

Resumo

Atualmente, muitos estudos aplicados com dados de sobrevivência envolvem situações em que pode haver um grupo de indivíduos não suscetíveis ao evento de interesse. Os modelos com fração de cura foram desenvolvidos de forma a incorporar a possibilidade de existência de um grupo de pacientes que não experimentarão o evento de interesse (chamados de pacientes curados). Existem propostas na literatura duas principais abordagens para tal situação: modelos de mistura e modelos de risco acumulado limitado. Neste trabalho, consideramos ambas abordagens e, utilizando a técnica IPCW (Inverse Probability of Censoring Weights), propomos equações de estimação para parâmetros de interesse, permitindo a inclusão de covariáveis nos modelos. A metodologia proposta é ilustrada com a análise de dados de pacientes com insuficiência cardíaca do Instituto do Coração (InCor) da Universidade de São Paulo. Um estudo de simulação foi conduzido para avaliar a performance do estimador e avaliou-se a utilização de matriz de correlação de trabalho na equação de estimação com o intuito de melhorar o desempenho do estimador proposto.

Apresentação 13 Título

(6)

Modelling Volatility Using State Space Models with Heavy Tailed Distributions.

Autores

Frank Magalhaes de Pinho, Glaura da Conceição Franco, Ralph dos Santos Silva.

Resumo

This article deals with a non-Gaussian state space model (NGSSM), which is a generalization of the results in Smith and Miller (1986). The NGSSM is attractive because the likelihood can be analytically computed, thus avoiding the use of highly demanding computational algorithms such as the particle lter in order to make inference on the parameters. The paper focuses on stochastic volatility models in the NGSSM, where the observation equation is modelled with a heavy tailed distribution such as Log-normal, Log-gamma and Fréchet. Parameter estimation can be accomplished either using classical or Bayesian procedures and a simulation study shows that both methods lead to satisfactory results. In a real data application, the proposed stochastic volatility models in the NGSSM are compared with the autoregressive conditionally heteroscedastic and stochastic volatility models using South and North American stock price indexes.

Título

A Bayesian Long-term Survival Model with Partitions Structures for Categorical Covariates.

Autores

Jhon F. Bernedo Gonzales, Mário de Castro Andrade Filho, Francisco Louzada, Vera L. Tomazella.

Resumo

In this paper we propose a Bayesian partition modeling for lifetime data in presence of a cure fraction by considering a local structure generated by a tessellation which depends on covariates. In this modelling we including information of nominal qualitative variables with more than two categories or ordinal qualitative variables. The proposed modeling is based on a promotion time cure model structure but assuming that the number of competing causes follow a geometric distribution. It is an alternative modeling strategy to the conventional survival regression modeling generally used for modeling lifetime data in presence of a cure fraction, which models the cure fraction through a (generalized) linear model of the covariates. An advantage of our approach is its ability to capture the effects of covariates in a local structure. The flexibility of having a local structure is crucial to capture local effects and features of the data. The modeling is illustrated on two real melanoma data sets.

Título

Shrinkage Hierarchical Prior Distributions in Multivariate Dynamic Models.

Autores

Cristian Andres Cruz Torres, Helio dos Santos Migon.

Resumo

(7)

In this paper we develop a full Bayesian model for various classes of shrinkage hierarchical prior distributions. We evaluate the relative performance of the alternative full Bayesian model and show an application to dynamic multivariate models. A comprehensive review of the literature is presented and is applied to data from the Brazilian economy. The proposed methods were efficient to detect structural changes in the Brazilian economy during the period analyzed.

Título

Bartlett Corrections in Beta Regression Models.

Autores

Fábio M. Bayer, Francisco Cribari-Neto.

Resumo

We consider the issue of performing accurate small-sample testing inference in beta regression models, which are useful for modeling continuous variates that assume values in (0,1), such as rates and proportions. We derive the Bartlett correction to the likelihood ratio test statistic and also consider a bootstrap Bartlett correction. Using Monte Carlo simulations we compare the finite sample performances of the two corrected tests to that of the standard likelihood ratio test and also to its variant that employs Skovgaard's adjustment; the latter is already available in the literature. The numerical evidence favors the corrected tests we propose. We also present an empirical application.

Título

On the Gradient Statistic under Model Misspecification.

Autores

Artur J. Lemonte.

Resumo

A robust gradient statistic under model misspecification is proposed and its asymptotic distribution under the null hypothesis is determined. Several examples are presented and Monte Carlo simulation experiments are carried out.

Título

The Ecological Footprint of Taylor’s Universal Power Law.

Autores

Bent Jorgensen, Wayne S. Kendal, Clarice G. B. Demétrio, René Holst.

Resumo

In 1961, L. R. Taylor proposed a power law for the relationship between the spatial mean and variance of population abundance in biology, which over time has been observed for a large number of species, as well as in many other epidemiological, social and physical systems. Many disparate explanations for the power law have been

(8)

proposed in the literature, but no consensus on their adequacy has emerged. We review the historical background for the power law and some the controversies surrounding it. We then turn to a possible theoretical explanation based on the power variance functions of the so-called Tweedie distributions, along with a new spatial self- similarity hypothesis. The theory suggests that there are actually two separate power laws in swing, one describing the spatial variance as a function of population density, and the other involving the size of the sampled area. This is confirmed empirically, and in turn leads us to propose a new log-linear spatial modelling framework with long- range dependence, which allows simultaneous estimation of the regression parameters and the two power parameters. We discuss some of the possible ramifications of these ideas for studying animal abundance data and other spatial phenomena exhibiting clustering and long-range dependence.

Título

The Gamma Birnbaum-Saunders Distribution.

Autores

Maria do Carmo S. Lima, Gauss M. Cordeiro, Rodrigo R. Pescim, Marcelino A. R.

Pascoa, Audrey Cysneiros, Edwin M. M. Ortega.

Resumo

We propose a new three-parameter lifetime model called the gamma Birnbaum- Saunders distribution. We derive explicit expressions for the ordinary and incomplete moments, generating and quantile functions, probability weighted moments and mean deviations. The estimation of the model parameters is performed by maximum likelihood. The potentiality of the new model is illustrated by means of some applications to real data.

Título

The Censored Bimodal Skew-normal Model with an Application to Plasma HIV-1 RNA.

Autores

Guillermo Martínez-Flórez, Germán Moreno-Arenas.

Resumo

We introduce the censored bimodal skew-normal model for adjust censored data with bimodality and high levels of skewness and kurtosis. Various structural properties of the new distribution are derived, including its moments. The model parameters are estimated by maximum likelihood and the observed information matrix is derived. An application of the new model to real data (plasma HIV-1 RNA measurement) is presented for illustrative purpose.

Título

A New Weibull Family of Hazard Modesl for Breast Cancer Survival.

Autores

(9)

Gleici Castro Perdonã, Francisco Louzada, Cleyton Zanardo, Hayala Cavenague.

Resumo

In this paper, we discuss a family Weibull Modified of hazard model to breast cancer problematic. The breast cancer is addressed here by the high incidence and lack of knowledge in survival among women worldwide. The model is very flexible, and accommodate several particular cases. Inference procedure is based on maximum likelihood. A simulation study is performed in order to verify the frequentists properties of the maximum likelihood estimation procedure. A real example on breast cancer is addressed.

Título

Binary Regression Analysis with Network Structure of Respondent-driven Sampling Data.

Autores

Leonardo S. Bastos, Claudia T. Codeço, Adriana A. Pinho, Francisco I. P. Bastos.

Resumo

Respondent-driven sampling (RDS) is a procedure to sample from hard-to-reach populations. It has been widely used in several countries, especially in the monitoring of HIV/AIDS and other sexually transmitted infections. Hard-to-reach populations have had a key role in the dynamics of such epidemics and must inform evidence-based initiatives aiming to curb their spread. In this paper, we present a simple test for network dependence for a binary response variable, and we propose a binary regression model taking into account the RDS structure which is included in the model through correlated latent random effects. The proposed model is illustrated in a RDS study for HIV and syphilis in men who have sex with men (MSM) implemented in Campinas (Brazil).

Título

Modelagem Bayesiana Flexível em Regressão com Erros nas Variáveis.

Autores

Nelson Lima de Souza Filho, Celso Rômulo Barbosa Cabral.

Resumo

Em modelos de regressão, o pressuposto clássico de normalidade para a distribuição dos erros aleatórios é muitas vezes violado, mascarando algumas características importantes da variabilidade dos dados. Algumas ações práticas para resolver esse problema, como transformações nos dados, revelam-se muitas vezes ineficazes.

Neste trabalho apresentamos uma proposta para lidar com esta questão no contexto do modelo de regressão multivariada linear simples, quando a variável resposta e a variável regressora são observadas com erro aditivo -- o chamado modelo de regressão linear com erros nas variáveis. Em tais modelos, o pesquisador observa uma variável substituta em vez da covariável de interesse. Nós estendemos o modelo clássico normal, modelando a distribuição conjunta da covariável e dos erros aleatórios por uma mistura finita de densidades pertencentes a uma família de distribuições bem geral, acomodando ao mesmo tempo assimetria, caudas pesadas e

(10)

multimodalidade, permitindo um grau de flexibilidade que não pode ser atingido pelo modelo normal. Para a parte de estimação desenvolvemos um algoritmo do tipo Gibbs para proceder estimação Bayesiana. Alguns modelos propostos foram comparados com modelos simétricos já existentes na literatura, utilizando um critério DIC modificado, através da análise de dados simulados e reais.

Título

A New Testlet model.

Autores

Sandra Flores, Caio Azevedo, Jorge Luis Bazan.

Resumo

Um pressuposto fundamental na Teoria de Resposta ao Item (TRI) é a independência condicional entre as respostas item, mas em muitas situações este pressuposto falha, como acontece na avaliacao da compreensão de leitura, onde uns conjuntos de itens estão relacionados a uma passagem de leitura (testlet). Um modelo TRI de efeito aleatório foi proposto por Bradlow et al. (1999), para superar o problema. Inspirado neste trabalho, propomos um novo modelo Testlet nomeado modelo testlet logística de expoente positivo (LPET), que inclue o modelo testlet logística (LT) como um modelo específico. Este modelo é uma extensão do modelo LPE proposto por Samejima (1999) e Bolfarine e Bazan (2010), que considera curvas caracteristicas assimétricas dos itens (ICC). Estudos de simulação mostram que, quando um conjunto de respostas está em testlets, os parâmetros são mais bem estimados por modelos de Testlet. Além disso, a ICC assimetrica pode ser mais apropriada em algumas situações. Uma aplicacao com um conjunto de dados reales de um teste de compreensão de leitura em escolas primárias do Peru permite concluir que a abordagem é um caminho muito promissor e exível para lidar com as formas testlet do teste.

Título

Teste de Hipótese e Seleção de Variáveis em Regressão não Paramétrica.

Autores

Adriano Zanin Zambom, Michael G. Akritas.

Resumo

Seja X um vetor d dimensional de covariáveis e Y a variável resposta. Sob o modelo não paramétrico Y = m(X) + sigma(X)Epsilon, desenvolvemos um teste baseado em idéias de ANOVA para a hipótese nula que uma coordenada particular de X não tem influência na função de regressão. A distribuição assintótica da estatística do teste, usando resíduos baseados no estimador de regressão Nadaraya-Watson e d < 5, é estabelecida sob a hipótese nula e alternativas locais. Simulações sugerem que sob um modelo esparso, a aplicabilidade do teste se extende para d arbitrário através de redução de dimensão suficiente (Sufficient Dimension Reduction). Usando p-valores deste teste, um método de seleção de variáveis baseado em idéias de múltiplos testes (False Discovery Rate - FDR) é proposto.

(11)

Título

A Hierarchical Model for Aggregated Functional Data.

Autores

Ronaldo Dias, Nancy L. Garcia, Alexandra M. Schmidt.

Resumo

In many areas of science one aims to estimate latent sub-population mean curves based only on observations of aggregated population curves. By aggregated curves we mean linear combination of functional data that cannot be observed individually. We assume that several aggregated curves with linearly independent coefficients are available and each aggregated curve is an independent partial realization of a Gaussian process with mean modeled through a weighted linear combination of the disaggregated curves. The mean of the Gaussian process is modeled using B-splines basis expansion methods. We propose a semi-parametric, valid covariance function, which is modeled as the product of a nonparametric variance function by a correlation function. The variance function is described as the square of a function that is expanded using B-splines basis functions. This results in a non-stationary covariance function and includes constant variance models as special cases. Inference is performed following the Bayesian paradigm allowing experts' opinion, when available, to be accounted for. Moreover, it naturally provides the uncertainty associated with the parameters estimates and fitted values. We analyze artificial datasets and discuss how to choose among the different covariance models. We focus on two different real examples: a calibration problem for NIR spectroscopy data and an analysis of distribution of energy among different types of consumers. In the latter example, our proposed covariance function captures interesting features of the data. Further analysis of different artificial datasets, as well as computer code and data are available as supplementary material online.

Título

Robust Linear Functional Mixed Models.

Autores

Marco Riquelme, Heleno Bolfarine, Manuel Galea.

Resumo

In this work we propose a linear functional model with normal random effects and elliptical errors extending standard normal models considered previously. The corrected score approach (Nakamura, 1990) is used for parameter estimation and the resulting estimators are shown to be consistent and asymptotically normal. The local influence approach (Cook, 1986) is used for assessing influence of small perturbations on parameter estimates. A simulation study is presented illustrating the good performance of the proposed approach, including the robustness property for the heavier tail models. Key Words: Corrected score estimators; Local influence; Elliptical distributions; Linear functional mixed models.