Trabalhos futuros - Uma melhoria do algoritmo K-SVD com aplicações em reconhecimento facial

Em relação ao desenvolvimento de trabalhos futuros, sugere-se a aplicação da técnica proposta em outras bases de dados, tanto de reconhecimento facial, como de reconhecimento de cenas e reconhecimento de objetos, a ﬁm de avaliar sua eﬁciência para esses

68 Capítulo 5. Conclusão

casos. Além disso, outras combinações de parametrização de – podem ser desenvolvidas com o objetivo de melhorar signiﬁcativamente o resultado do K-SVD quando o valor de esparsidade L é alto. Uma vez que o processo de parametrização pode ser dispendioso, recomenda-se a investigação de meta-heurísticas ou técnicas automáticas para esse ﬁm. Uma possibilidade, nesse sentido, seria a implementação de um Algoritmo Genético para encontrar combinações adequadas para o vetor –, ou, até mesmo, para otimizar a função objetivo do problema de aprendizado de dicionário.

De forma análoga ao realizado nesse trabalho, pode-se desenvolver estratégias es- pecíﬁcas para o aperfeiçoamento da taxa de classiﬁcação, ao contrário do erro de recupe- ração. Uma ideia que pode ser explorada, nesse contexto, é o “descarte de colunas”. Uma vez que o dicionário é ajustado no processo de aprendizado de dicionário, sabe-se que as representações esparsas xi de cada elemento do conjunto de treino está associada a um

conjunto de até L colunas da matriz D. O rótulo de cada vetor xi também é conhecido,

e, portanto, é possível relacionar as colunas do dicionário D com as classes do conjunto de treino. Lança-se como hipótese que, para construir uma representação esparsa mais discriminante, pode-se renovar as colunas que são utilizadas por elementos de treino de classes diferentes. Presume-se que colunas utilizadas por elementos de diversas classes não guardam informações relevantes sobre uma classe especíﬁca, logo essas colunas pode- riam ser removidas da representação esparsa. Dessa forma, perder-se-ia a capacidade de recuperar informação, mas conjectura-se que uma melhora na taxa de classiﬁcação será observada.

Referências

AHARON, M.; ELAD, M.; BRUCKSTEIN, A. K-svd: Design of dictionaries for sparse representation. In: IN: PROCEEDINGS OF SPARS’05. [S.l.: s.n.], 2005. p. 9–12. AHARON, M.; ELAD, M.; BRUCKSTEIN, A. K -svd: An algorithm for designing overcomplete dictionaries for sparse representation. Signal Processing, IEEE Transactions on, v. 54, n. 11, p. 4311–4322, Nov 2006. ISSN 1053-587X.

AHARON, M.; ELAD, M.; BRUCKSTEIN, A. M. On the uniqueness of overcomplete dictionaries, and a practical way to retrieve them. Linear Algebra and its Applications, v. 416, n. 1, p. 48 – 67, 2006. ISSN 0024-3795. Disponível em: <http://www- .sciencedirect.com/science/article/pii/S0024379505003459>.

BELHUMEUR, P. N.; HESPANHA, J. P.; KRIEGMAN, D. J. Eigenfaces vs. Fisherfaces: Recognition Using Class Speciﬁc Linear Projection. 1997.

BISHOP, C. M. Pattern Recognition and Machine Learning (Information Science and Statistics). Secaucus, NJ, USA: Springer-Verlag New York, Inc., 2006. ISBN 0387310738. BOUREAU, Y.-L. et al. Learning mid-level features for recognition. In: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. [S.l.: s.n.], 2010. p. 2559–2566. ISSN 1063-6919.

BRUCKSTEIN, A. M.; DONOHO, D. L.; ELAD, M. From sparse solutions of systems of equations to sparse modeling of signals and images. SIAM Review, v. 51, n. 1, p. 34–81, 2009.

CANDES, E.; TAO, T. Decoding by linear programming. Information Theory, IEEE Transactions on, v. 51, n. 12, p. 4203–4215, 2005. ISSN 0018-9448.

CANDÈS, E. J. Compressive sampling. In: Proceedings oh the International Congress of Mathematicians: Madrid, August 22-30, 2006: invited lectures. [S.l.: s.n.], 2006. p. 1433–1452.

CHEN, S.; DONOHO, D. Basis Pursuit. [S.l.], 1994.

CHEN, S. S. et al. Atomic decomposition by basis pursuit. SIAM Journal on Scientiﬁc Computing, v. 20, p. 33–61, 1998.

CHEN, S. S.; DONOHO, D. L.; SAUNDERS, M. A. Atomic decomposition by basis pursuit. SIAM Rev., Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, v. 43, n. 1, p. 129–159, jan. 2001. ISSN 0036-1445. Disponível em: <http://dx.doi.org/10.1137/S003614450037906X>.

DAVENPORT, M. A.; WAKIN, M. B. Analysis of orthogonal matching pursuit using the restricted isometry property. Information Theory, IEEE Transactions on, IEEE, v. 56, n. 9, p. 4395–4401, 2010.

70 Referências

DOMINGOS, P. A few useful things to know about machine learning. Commun. ACM, ACM, New York, NY, USA, v. 55, n. 10, p. 78–87, out. 2012. ISSN 0001-0782. Disponível em: <http://doi.acm.org/10.1145/2347736.2347755>.

DONOHO, D. L.; ELAD, M. Optimally sparse representation in general (nonorthogonal) dictionaries via l1 minimization. Proceedings of the National Academy of Sciences, National Acad Sciences, v. 100, n. 5, p. 2197–2202, 2003.

DONOHO, D. L.; HUO, X. Uncertainty principles and ideal atomic decomposition. Information Theory, IEEE Transactions on, IEEE, v. 47, n. 7, p. 2845–2862, 2001. DONOHO, D. L.; TANNER, J. Counting faces of randomly-projected polytopes when the projection radically lowers dimension. J. of the AMS, p. 1–53, 2009.

EFRON, B. et al. Least angle regression. The Annals of statistics, Institute of Mathematical Statistics, v. 32, n. 2, p. 407–499, 2004.

ELAD, M. Sparse and Redundant Representations: From Theory to Applications in Signal and Image Processing. 1st. ed. [S.l.]: Springer Publishing Company, Incorporated, 2010. ISBN 144197010X, 9781441970107.

ELAD, M.; AHARON, M. Image denoising via sparse and redundant representations over learned dictionaries. Image Processing, IEEE Transactions on, v. 15, n. 12, p. 3736 –3745, dec. 2006. ISSN 1057-7149.

ENGAN, K.; AASE, S. O.; HUSOY, J. H. Method of optimal directions for frame design. In: IEEE. Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on. [S.l.], 1999. v. 5, p. 2443–2446.

GEORGHIADES, A.; BELHUMEUR, P.; KRIEGMAN, D. From few to many: Illumination cone models for face recognition under variable lighting and pose. IEEE Trans. Pattern Anal. Mach. Intelligence, v. 23, n. 6, p. 643–660, 2001.

GONZALEZ, R. C.; WOODS, R. E. Digital Image Processing (3rd Edition). Upper Saddle River, NJ, USA: Prentice-Hall, Inc., 2006. ISBN 013168728X.

HE, X. et al. Face recognition using laplacianfaces. Pattern Analysis and Machine Intelligence, IEEE Transactions on, v. 27, n. 3, p. 328–340, March 2005. ISSN 0162-8828. HUANG, K.; AVIYENTE, S. Sparse representation for signal classiﬁcation. In: Advances in Neural Information Processing Systems (NIPS 2006). [S.l.]: MIT Press, 2006. p. 609–616.

JENATTON, R. et al. Proximal methods for hierarchical sparse coding. The Journal of Machine Learning Research, JMLR. org, v. 12, p. 2297–2334, 2011.

JIANG, Z.; LIN, Z.; DAVIS, L. Learning a discriminative dictionary for sparse coding via label consistent k-svd. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. [S.l.: s.n.], 2011. p. 1697–1704. ISSN 1063-6919.

JORTNER, R. A.; FARIVAR, S. S.; LAURENT, G. A simple connectivity scheme for sparse coding in an olfactory system. The Journal of Neuroscience, v. 27, n. 7, p. 1659– 1669, 2007. Disponível em: <http://www.jneurosci.org/content/27/7/1659.abstract>.

Referências 71

KIRKPATRICK, S.; GELATT, C. D.; VECCHI, M. P. Optimization by simulated annealing. Science, v. 220, n. 4598, p. 671–680, 1983. Disponível em: <http://www- .sciencemag.org/content/220/4598/671.abstract>.

KOTSIANTIS, S. B.; AL. et. Data Preprocessing for Supervised Learning. 2006. LEE, H. et al. Efficient sparse coding algorithms. In: SCHöLKOPF, B.; PLATT, J.; HOFFMAN, T. (Ed.). Advances in Neural Information Processing Systems 19. Cambridge, MA: MIT Press, 2007. p. 801–808.

LEE, K.; HO, J.; KRIEGMAN, D. Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Anal. Mach. Intelligence, v. 27, n. 5, p. 684–698, 2005.

MACQUEEN, J. Some methods for classiﬁcation and analysis of multivariate

observations. Berkeley, Calif.: University of California Press, 1967. 281–297 p. Disponível em: <http://projecteuclid.org/euclid.bsmsp/1200512992>.

MAIRAL, J.; BACH, F.; PONCE, J. Task-driven dictionary learning. IEEE Trans. Pattern Anal. Mach. Intell., v. 34, n. 4, p. 791–804, 2012.

MAIRAL, J. et al. Discriminative learned dictionaries for local image analysis. In: CVPR. [S.l.: s.n.], 2008.

MAIRAL, J. et al. Non-local sparse models for image restoration. In: Computer Vision, 2009 IEEE 12th International Conference on. [S.l.: s.n.], 2009. p. 2272 –2279. ISSN 1550-5499.

MAIRAL, J.; ELAD, M.; SAPIRO, G. Sparse representation for color image restoration. Image Processing, IEEE Transactions on, v. 17, n. 1, p. 53–69, jan. 2008. ISSN 1057-7149.

MALKOMES, L. G.; BRITO, C. F.; PORDEUS, J. P. An improvement of the k-svd algorithm with applications on face recognition. In: Intelligent Systems (BRACIS), 2014 Brazilian Conference on. [S.l.: s.n.], 2014. p. To appear.

MALLAT, S.; ZHANG, Z. Matching pursuits with time-frequency dictionaries. Signal Processing, IEEE Transactions on, v. 41, n. 12, p. 3397 –3415, dec 1993. ISSN 1053-587X. MARTINEZ, A.; BENAVENTE, R. The ar face database. CVC Technical Report, n. 24, 2001.

MITCHELL, T. M. Machine Learning. 1. ed. New York, NY, USA: McGraw-Hill, Inc., 1997. ISBN 0070428077, 9780070428072.

MURPHY, K. P. Machine learning: a probabilistic perspective. Cambridge, MA: [s.n.], 2012.

NATARAJAN, B. K. Sparse approximate solutions to linear systems. SIAM journal on computing, SIAM, v. 24, n. 2, p. 227–234, 1995.

OLSHAUSEN, B. A.; FIELD, D. J. Emergence of simple-cell receptive ﬁeld properties by learning a sparse code for natural images. Nature(London), v. 381, p. 607–609, 1996.

72 Referências

OLSHAUSEN, B. A.; FIELD, D. J. Sparse coding with an overcomplete basis set: a strategy employed by v1. Vision Research, v. 37, p. 3311–3325, 1997.

PATI, Y. C.; REZAIIFAR, R.; KRISHNAPRASAD, P. Orthogonal matching pursuit: Recursive function approximation with applications to wavelet decomposition. In: IEEE. Signals, Systems and Computers, 1993. 1993 Conference Record of The Twenty-Seventh Asilomar Conference on. [S.l.], 1993. p. 40–44.

PHAM, D.-S.; VENKATESH, S. Joint learning and dictionary construction for pattern recognition. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. [S.l.: s.n.], 2008. p. 1–8. ISSN 1063-6919.

RAINA, R. et al. Self-taught learning: Transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning. New York, NY, USA: ACM, 2007. (ICML ’07), p. 759–766. ISBN 978-1-59593-793-3. Disponível em: <http://doi.acm.org/10.1145/1273496.1273592>.

TIBSHIRANI, R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B, v. 58, p. 267–288, 1994.

TURK, M.; PENTLAND, A. Eigenfaces for recognition. J. Cognitive Neuroscience, MIT Press, Cambridge, MA, USA, v. 3, n. 1, p. 71–86, jan. 1991. ISSN 0898-929X. Disponível em: <http://dx.doi.org/10.1162/jocn.1991.3.1.71>.

WANG, J.; SHIM, B. A simple proof of the mutual incoherence condition for orthogonal matching pursuit. arXiv preprint arXiv:1105.4408, 2011.

WRIGHT, J. et al. Proceedings of the IEEE, v. 98, n. 6, p. 1031 –1044, june 2010. ISSN 0018-9219.

WRIGHT, J. et al. Robust face recognition via sparse representation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, v. 31, n. 2, p. 210–227, Feb 2009. ISSN 0162-8828.

YANG, J. et al. Linear spatial pyramid matching using sparse coding for image classiﬁcation. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. [S.l.: s.n.], 2009. p. 1794 –1801. ISSN 1063-6919.

YANG, M.; ZHANG, D.; YANG, J. Robust sparse coding for face recognition. In: IEEE. Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. [S.l.], 2011. p. 625–632.

YU, K.; LIN, Y.; LAFFERTY, J. Learning image representations from the pixel level via hierarchical sparse coding. In: IEEE. Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. [S.l.], 2011. p. 1713–1720.

YUAN, G.-X. et al. A comparison of optimization methods and software for large-scale l1-regularized linear classiﬁcation. The Journal of Machine Learning Research, JMLR. org, v. 11, p. 3183–3234, 2010.

ZHANG, Q.; LI, B. Discriminative k-svd for dictionary learning in face recognition. In: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. [S.l.: s.n.], 2010. p. 2691 –2698. ISSN 1063-6919.

No documento Uma melhoria do algoritmo K-SVD com aplicações em reconhecimento facial (páginas 70-75)