Abstract
We discuss a variety of methods for quantifying categorical multivariate data. These methods have been proposed in many different countries, by many different authors, under many different names. In the first major section of the paper we analyze the many different methods and show that they all lead to the same equations for analyzing the same data. In the second major section of the paper we introduce the notion of a duality diagram, and use this diagram to synthesize the many superficially different methods into a single method.
Similar content being viewed by others
References
Baker, F. B. (1960). Univac scientific computer program for scaling of psychological inventories by the method of reciprocal averages CPA 22.Behavioral Science, 5, 268–269.
Benzécri, J. P. (1973).L'analyse des données: T. 2, l'analyse des correspondances [Data Analysis: T. 2, Correspondence analysis]. Paris: Dunod.
Benzécri, J. P. (1977a). Historie et préhistoire de l'analyse des données: l'analyse des correspondances [History and prehistory of data analysis: correspondence analysis].Les Cahiers de l'Analyse des Données, 2, 9–53.
Benzécri, J. P. (1977b). Sur l'analyse des tableaux binaires associés à une correspondance multiple [The analysis of boolean tables associated with a multiple correspondence].Les Cahiers de l'Analyse des Données, 2, 55–71.
Bock, R. D. (1960).Methods and applications of optimal scaling (Rep. No. 25). Chapel Hill: University of North Carolina.
Bouroche, J. M., Saporta, G., & Tenenhaus, M. (1975, August).Generalized canonical analysis of qualitative data. Paper presented at the U.S.-Japan Seminar on Theory, Methods and Applications of Multidimensional Scaling and Related Techniques, San Diego: University of California.
Burt, C. (1950). The factorial analysis of qualitative data.British Journal of Psychology, 3, 166–185.
Burt, C. (1953). Scale analysis and factor analysis.British Journal of Statistical Psychology, 6, 5–23.
Cailliez, F., & Pagès, J. P. (1976).Introduction à l'analyse des données [Introduction to data analysis]. Paris: Smash.
Carroll, J. D. (1968). Generalization of canonical correlation analysis to three or more sets of variables.Proceedings of the 76th Annual Convention of the American Psychological Association, 3, 227–228.
Cazes, P. (1972).Etude du dédoublement d'un tableau en analyse des correspondances [Analysis of a table and its complementary in correspondence analysis]. Unpublished manuscript, Université Pierre et Marie Curie, Laboratoire de Statistique Mathématique, Paris.
Cazes, P., Baumerder, A., Bonnefous, S., & Pagès, J. P. (1977). Codage et analyse des tableaux logiques. Introduction à la pratique des variables qualitatives [Scaling and analysis of binary tables. Introduction to the practice of qualitative variables].Cahiers du Bureau Universitaire de Recherche Opérationnelle (Série Recherche, Cahier No. 27) Paris: Institut de Statistique des Universités de Paris, Université Pierre et Marie Curie.
Daudin, J. J., & Trecourt, P. (1980). Analyse factorielle des correspondances et modèle log-linéaire. Comparaison des deux méthodes sur un exemple [Correspondence analysis and Log-linear model. Comparison of both models on an example].Revue de Statistique Appliquée, 28, 5–24.
de Leeuw, J. (1973).Canonical analysis of categorical data. Unpublished doctoral dissertation, University of Leiden, Leiden, The Netherlands.
Dempster, A. P. (1969).Elements of continuous multivariate analysis. Reading, MA: Addison-Wesley.
Eckart, C., & Young, C. (1936). The approximation of one matrix by another of lower rank.Psychometrika, 1, 211–218.
Escofier, B. (1979a). Une représentation des variables dans l'analyse des correspondances multiples [Representation of variables in multiple correspondence analysis].Revue de Statistique Appliquée, 27, 37–47.
Escofier, B. (1979b). Traitement simultané de variables qualitatives et quantitatives en analyse factorielle [Simultaneous treatment of qualitative and quantitative variables in factor analysis].Les Cahiers de l'Analyse des Données, 4, 137–146.
Fisher, R. A. (1940). The precision of discriminant functions.Annals of Eugenics, 10, 422–429.
Gifi, A. (1981).Nonlinear multivariate analysis. Leiden, The Netherlands: University of Leiden, Afdeling Datatheorie.
Greenacre, M. J. (1984).Theory and applications of correspondence analysis. London: Academic Press.
Guttman, L. (1941). The quantification of a class of attributes: A theory and method of scale construction. In P. Horst et al. (Eds.),The prediction of personal adjustment. (pp. 319–348). New York: Social Science Research Council.
Guttman, L. (1950). The principal components of scale analysis. In S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeld, S. A. Star, & J. A. Clausen.Measurement and prediction. Princeton: Princeton University Press.
Guttman, L. (1953). A note on Sir Cyril Burt's factorial analysis of qualitative data.British Journal of Statistical Psychology, 6, 1–4.
Guttman, L. (1959). Metricizing rank-ordered or unordered data for a linear factor analysis.Sankhya, 21, 257–268.
Hayashi, C. (1950). On the quantification of qualitative data from the mathematico-statistical point of view.Annals of the Institute of Statistical Mathematics, 2 (No. 1), 35–47.
Hayashi, C. (1952). On the prediction of phenomena from qualitative data and the quantification of qualitative data from the mathematico-statistical point of view.Annals of the Institute of Statistical Mathematics, 3 (No. 2), 69–98.
Hayashi, C. (1954). Multidimensional quantification—with applications to analysis of social phenomena.Annals of the Institute of Statistical Mathematics, 5 (No. 2), 121–143.
Healy, M. J. R., & Goldstein, H. (1976). An approach to the scaling of categorized attributes.Biometrika, 63, 219–229.
Hill, M. O. (1973). Reciprocal averaging: An eigenvector method of ordination.Journal of Ecology, 61, 237–251.
Hill, M. O., & Smith, J. E. (1976). Principal component analysis of taxonomic data with multi-state discrete characters.Taxonomy, 25, 249–255.
Hirshfield, H. O. (1935). A connection between correlation and contingency.Cambridge Philosophical Society Proceedings, 31, 520–524.
Horst, P. (1935). Measuring complex attitudes.Journal of Social Psychology, 6, 369–374.
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 24, 417–441, 498–520.
Kettenring, J. R. (1971). canonical analysis of several sets of variables.Biometrika, 58, 433–451.
Lauro, N. C., & Decarli, A. (1982). Correspondence analysis and log-linear models in multiway contingency tables study: Some remarks on experimental data.Metron, 40, 213–234.
Lebart, L. (1975). L'orientation du dépouillement de certaines enquêtes par l'analyse des correspondances multiples [The orientation of the analysis of some surveys by multiple correspondence analysis].Consommation, 2, 73–96.
Lebart, L., & Fénelon, J. P. (1971).Statistique et informatique appliquées [Applied statistics and informatics]. Paris: Dunod.
Lebart, L., Morineau, A. & Tabard, N. (1977).Techniques de la description statistique [Statistical description technics]. Paris: Dunod.
Lebart, L., Morineau, A., & Warwick, K. M. (1984).Multivariate descriptive analysis: Correspondence analysis and related techniques for large matrics. New York: Wiley-Interscience.
Leclerc, A. (1980) Quelques propriétés optimales en analyse de données en terme de corrélation entre variables [Some optimal properties in data analysis in term of correlation between variables].Mathématique et Sciences Humaines, 18, 51–67.
Levine, J. H. (1979). Joint space analysis of “pick any” data: Analysis of choices from an unconstrained set of alternatives.Psychometrika, 44, 85–92.
Lingoes, J. C. (1963).Multivariate analysis of contingencies: An IBM 7090 program for analyzing metric/non-metric or linear/non-linear data. [Computer program]. Ann Arbor, MI: University of Michigan Computing Center. (Computational Report, 2, 1–24.)
Lingoes, J. C. (1964). Simultaneous linear regression: An IBM 7090 program for analyzing metric/non-metric or linear/non-linear data.Behavioral Science, 9, 87–88.
Lingoes, J. C. (1968). The multivariate analysis of qualitative data.Multivariate Behavioral Research, 3, 61–94.
Lingoes, J. C. (1972). A general survey of the Guttman-Lingoes nonmetric program series. In R. N. Shepard, A. K. Romney, & S. Nerlove (Eds.),Multidimensional scaling: Theory and applications in the behavioral sciences, Vol. 1: Theory (pp. 49–68). New York: Seminar Press.
Lingoes, J. C. (1973).The Guttman-Lingoes nonmetric program series. Ann Arbor: Mathesis Press.
Lingoes, J. C. (1977).Geometric representations of relational data: Readings in multidimensional scaling. Ann Arbor: Mathesis Press.
Mardia, K. V., Kent, J. T., & Bibby, J. M., (1979).Multivariate Analysis. London: Academic Press.
Masson, M. (1974). Processus linéaires et analyse des données non linéaires [Linear processes and non-linear data analysis.] unpublished doctoral dissertation, Université Pierre et Marie Curie, Paris.
Masson, M. (1980).Méthodologies générales de traitement statistique de l'information de masse. [General methodologies for the statistical treatment of large information]. Paris: Cedic-Fernand Nathan.
McDonald, R. P. (1968). A unified treatment of the weighting problem.Psychometrika, 33, 351–381.
McKeon, J. J. (1966). Canonical analysis: Some relations between canonical correlation, factor analysis, discriminant function analysis and scaling theory. [Monograph No. 13].Psychometrika.
Mosier, C. I. (1946). Machine methods in scaling by reciprocal averages.Proceedings of Research Forum (pp. 35–39). New York: IBM Corporation.
Mosteller, F. (1949). A theory of scalogram analysis using noncumulative types of items. (Report No. 9). Cambridge: Harvard University, Laboratory of Social Relations.
Nishisato, S. (1972).Optimal scaling and its generalizations. I (Methods. Measurement and evaluation of Categorical Data Technical Report No. 1) Toronto: Department of Measurement and Evaluation, the Ontario Institute for Studies in Education.
Nishisato, S. (1973).Optimal scaling and its generalizations. II (Applications. Measurement and Evaluation of Categorical Data Technical Report No. 2). Toronto: Department of Measurement and Evaluation, the Ontario Institute for Studies in Education.
Nishisato, S. (1976).Optimal scaling as applied to different forms of data (Measurement and Evaluation of Categorical Data Technical Report No. 4). Toronto: Department of Measurement and Evaluation, the Ontario Institute for Studies in Education.
Nishisato, S. (1978).Multidimensional Scaling: A historical sketch and bibliography (Tech. Rep.). Toronto: Department of Measurement, Evaluation and Computer Applications, the Ontario Institute for Studies in Education.
Nishisato, S. (1979). Dual Scaling and its variants.New Directions for Testing and Measurement, 4, 1–12.
Nishisato, S. (1980).Analysis of categorical data: Dual Scaling and its applications. Toronto: University of Toronto Press.
Nishisato, S. (1982).Shitsuteki Data no Suryoka: Sotsui Shakudoho to sono Oyo. Tokyo: Asakura Shoten.
Nishisato, S., & Inukai, Y. (1972). Partially optimal scaling of items with ordered categories.Japanese Psychological Research, 14, 109–119.
Nishisato, S., & Leong, K. S. (1975).OPSCAL: A FORTRAN IV Program for analysis of qualitative data by optimal scaling (Measurement and Evaluation of Categorical Data Technical Report No. 3). Toronto: Department of Measurement and Evaluation, The Ontario Institute for Studies in Education.
Nishisato, S., Sheu, W. J. (1980). Piecewise method of reciprocal averages for dual scaling of multiple-choice data.Psychometrika, 45, 467–478.
Rao, C. R. (1964). The use and interpretation of principal component analysis in applied research.Sankhya, series A,26, 329–358.
Richardson, M., & Kuder, G. F. (1933). Making a rating scale that measures.Personnel Journal, 12, 36–40.
Saito, T. (1973). Quantification of categorical data by using the generalized variance.Soken Kiyo, Nippon UNIVAC Sogo Kenkyu-sho, Inc., 61–80.
Saporta, G. (1975). Liaison entre plusieurs ensembles de variables et codage de données qualitatives [Relationship between several sets of variables and scaling of qualitative data]. Unpublished doctoral dissertation, Université Pierre et Marie Curie, Paris.
Saporta, G. (1980, June).About some remarkable properties of generalized canonical analysis. Paper presented at the second European meeting of the Psychometric Society, Groningen, The Netherlands.
Shiba, S. (1965). A method for scoring multicategory items.Japanese Psychological Research, 7, 75–79.
Tenenhaus, M. (1977). Analyse en composantes principales d'un ensemble de variables nominales ou numériques [Principal component analysis of a set of nominal or numerical variables].Revue de Statistique Appliquée, 25, 39–56.
Torgerson, W. S. (1958).Theory and methods of scaling. New York: Wiley.
Van Rijckevorsel, J., & de Leeuw, J. (1978).An outline to HOMALS-1. Leiden: University of Leiden.
Van Rijckevorsel, J., & de Leeuw, J. (1979).An outline to PRINCALS. Leiden: University of Leiden.
Author information
Authors and Affiliations
Additional information
The ideas in this paper were worked out by the first author, with some suggestions provided by the second. The current version of this paper has evolved from three previous versions, the first two written by the first author.
Rights and permissions
About this article
Cite this article
Tenenhaus, M., Young, F.W. An analysis and synthesis of multiple correspondence analysis, optimal scaling, dual scaling, homogeneity analysis and other methods for quantifying categorical multivariate data. Psychometrika 50, 91–119 (1985). https://doi.org/10.1007/BF02294151
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02294151