Application-oriented robotic vision – a review

R. A. Jarvis

doi:10.1017/S026357470000878X

Application-oriented robotic vision – a review

Published online by Cambridge University Press: 09 March 2009

R. A. Jarvis

Show author details

R. A. Jarvis: Affiliation:
Department of Computer Science, Australian National UniversityG.P.O. Box 4 Canberra A.C.T. 2600 (Australia)

Article contents

Summary
References

Get access

Rights & Permissions

Summary

Computer Vision is essentially concerned with emulating the process of seeing, naturally manifested in various higher biological systems,1–4 on a computational apparatus, and is consequently part of the Artificial Intelligence field within the sub-category of Machine perception. Seeing has to do with making sense of image data acquired through an optical system and subsequently dealt with at increasing levels of abstraction and association with known facts about the world. The spectrum of interest in Computer Vision ranges from attempting to answer basic questions concerning the functionality of biological vision systems, particularly human, at one end, all the way to enhancing the reliability, speed and cost effectiveness of specific industrial operations, particularly component inspection and vision driven robotic manipulation. The main bulk of interest is in the middle, where the quest for generality pushes interest towards biological vision systems with their demonstrated effectiveness in a wide range of environments, some hostile, whilst the need for economic viability and timeliness in relation to particular application pushes interest towards finding workable algorithms which function reliably at high speed on affordable apparatus.

This paper is addressed, in somewhat tutorial style, at clarifying, by examples of work in the area, the issues surrounding application oriented robotic vision systems, their assumptions, strengths, weaknesses and degree of generality, and at the same time putting them in the context of the overall field of Computer Vision. In addition, the paper points to directions of development which promise to provide powerful industrial vision tools at an acceptable price.

Information

Type: Article
Information: Robotica , Volume 2 , Issue 1 , January 1984 , pp. 3 - 15

DOI: https://doi.org/10.1017/S026357470000878X [Opens in a new window]
Copyright: Copyright © Cambridge University Press 1984

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

1.Hubel, D.H. and Wiesel, T.N., “Receptive Fields and Functional Architecture of Monkey Striate Cortex” J. Physiol. (Lond.) 195, 215–243 (1968) Presents some of the pioneering studies of the behaviour of cells in the visual cortex.CrossRef Google Scholar PubMed

2.Gregory, R.L., The Intelligent Eye (McGraw-Hill, New York, 1970).Google Scholar

3.Gibson, J.J., The Senses Considered as Perceptual Systems (Houghton-Mifflin, Boston, 1966).Google Scholar

4.Richter, K. and Ullman, S., “A Model for the Spatiotemporal Organization of X and Y-type Ganglion Cells in the Primate Retina” MIT AI Memo 573.Google Scholar

5.Muerle, J.L. and Allen, D.C., “Experimental Evaluation of Techniques for Automatic Segmentation of Objects in a Complex Scene” In: Pictorial Pattern Recognition (Thompson Book Company) (edited by Cheng, , Ledley, , Pollack, & Rosenfeld, ) (Thompson Book Co., Washington, D.C., 1968).Google Scholar

6.Brice, C.R. and Fennema, C.L., “Scene Analysis using Regions” Artificial Intelligence 1, 205–226 (1970).CrossRef Google Scholar

7.Jarvis, R.A., “Region Based Image Segmentation Using Shared Near Neighbor Clustering” Proc. Int'l. Conf. on Cyber. & Soc. (09 19–21, 1977, Washington, D.C.).Google Scholar

8.Yakimovsky, Y. and Feldman, J.A., “A Semantics-Based Decision Theoretic Region Analyzer” Proc. 3rd Joint Int'l. Conf. on Artificial Intell. (08 1973, Stanford Univ.) pp. 580–588.Google Scholar

9.Horowitz, S.L. and Pavlidis, T., “Picture Segmentation by a Directed Split-and-Merge Procedure” Proc. 2nd Int'l. Joint Conf. on Pat. Recog. (08 13–15, 1974, Copenhagen) pp. 424–433.Google Scholar

10.Tomita, F., Yachida, M. and Tsuji, S., “Detection of Homogeneous Regions by Structural Analysis” Proc. IJCAI-3 (Stanford, CA, 08 1973) pp. 564–571.Google Scholar

11.Jarvis, R.A., “Image Segmentation by Interactively Combining Line, Region and Semantic Structure” Proc. Comp. Graph., Pat. Recog. & Data Struc. Conf. (05 14–16, 1975, Beverley Hills, L.A., CA) pp. 278–288.Google Scholar

12.Tenenbaum, J.M. and Weyl, S., “A Region-Analysis Subsystem for Interactive Scene Analysis” Proc. 4th Int'l Joint Conf. on A.I., (1975) p. 682.Google Scholar

13.Rosenfeld, A. and Thurston, M. “Edge and Curve Detection for Visual Scene Analysis” IEEE Trans. on Comp. C-20, No. 5, pp 567–569 (05 1971).Google Scholar

14.Roberts, L., “Machine Perception of Three-Dimensional Solids” Lincoln Laboratory Technical Report N-15 (MIT, 05 1963).Google Scholar

15.Yoda, H., Motoike, J. and Ejiri, M., “Direction Coding Method and Its Application to Scene Analysis” Proc. 4th Int'l Joint Conf. on A.I. (1975) p. 620.Google Scholar

16.Rosenfeld, A. and Thurston, M., “Edge and Curve Detection: Further Experiments” IEEE Trans. on Comp. C-21, No. 7, 677–715 (07 1972).CrossRef Google Scholar

17.Hueckel, M., “A Local Visual Operator which Recognizes Edges and Lines” Journal of the ACM 20, No. 4, 634–647 (10 1973).CrossRef Google Scholar

18.Hueckel, M., “An Operator which Locates Edges in Digitized Pictures” Journal of the ACM 18, No. 1, 113–125 (01 1971).CrossRef Google Scholar

19.Shirai, Y., “Edge Finding, Segmentation of Edges and Recognition of Complex Objects” Proc. 4th Int'l Joint Conf. on A.I. (1975) p. 674.Google Scholar

20.Duda, R.O. and Hart, P.E., Pattern Classification and Scene Analysis (Wiley, New York, 1973).Google Scholar

21.Nagy, G., “State of the Art in Pattern Recognition” Proc. IEEE 56, No. 5, 836–863 (05 1968).CrossRef Google Scholar

22.Patrick, E.A., Fundamentals of Pattern Recognition (Prentice-Hall, N.J., 1972).Google Scholar

23.Watanabe, S. (Editor), Methodologies of Pattern Recognition (Academic Press, New York, London, 1969).Google Scholar

24.Watanabe, S. (Editor), Frontiers of Pattern Recognition (Academic Press, New York, London, 1972).Google Scholar

25.Kailath, T., “A General Likelihood - Ratio Formula for Random Signals in Gaussian Noise” IEEE Trans. Information Theory IT-15, 350–361 (05 1980).CrossRef Google Scholar

26.Unger, S.H., “Pattern Detection and Recognition” Proc. IRE (10 1959) pp. 1737–1752.CrossRef Google Scholar

27.Agin, G.J. & Duda, R.O., “SRI Vision Research for Advanced Industrial Application” Proc. 2nd USA-Japan Computer Conference (1975) pp. 113–117.Google Scholar

28.Cover, T.M. and Hart, P.E., “Nearest Neighbor Pattern Classification” IEEE Trans. Info. Theory IT-13, 21–27 (01 1967).CrossRef Google Scholar

29.Patrick, E.A. and Fischer, F.P. III, “A Generalized K- Nearest Neighbor Rule”, Information and Control 16, 128–152 (1970).CrossRef Google Scholar

30.Fu, K-S., Syntactic Methods in Methods in Pattern Recognition (Academic Press, New York, London, 1974).Google Scholar

31.Thomson, M.G., “Finite Fuzzy Automata, Regular Fuzzy Languages and Pattern Recognition” Pattern Recognition 5, 383–390 (1973).CrossRef Google Scholar

32.Zadeh, L.A., Fu, K-S., Tanaka, K. and Shimura, M., (Editors), Fuzzy Sets and Their Applications to Cognitive and Decision Processes (Academic Press, New York, San Francisco, London, 1975).Google Scholar

33.Jarvis, R.A. and Patrick, E.A., “Clustering Using a Similarity Measure Based on Shared Near-Neighbors” IEEE Trans. Computers 22, 1025, (11 1973).CrossRef Google Scholar

34.Zahn, C.T., “Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters” IEEE Trans. Computers C-20, 68–86 (01 1971).CrossRef Google Scholar

35.Anderberg, M.R., Cluster Analysis for Applications (Academic Press, NY, 1973).Google Scholar

36.Ball, G.H. and Hall, D.J., “ISODATA, An Iterative Method of Multivariate Analysis and Pattern Classification”, presented at the Inst. Commun. Conf., Philadelphia, PA (1966).Google Scholar

37.Hartigan, J.A., Clustering Algorithms (John Wiley and Sons, Chichester 1975), Chapter 11, pp. 191–214.Google Scholar

38.Diday, E. and Simon, J.C., “Clustering Analysis” In: Digital Pattern Recognition (Edited by Fu, K-S.) (Springer-Verlag, Berlin, 1976).Google Scholar

39.Sacerdoti, E.D., “The Nonlinear Nature of Plans” SRI, Artificial Intelligence Group, Technical Note 101 (01, 1975).Google Scholar

40.Nilsson, N.J., “Mechanisms for Goal Seeking, Planning and Reasoning” SRI, Artificial Intelligence Group, Technical Note 130 (05 1976).Google Scholar

41.Lozano-Perez, L. and Wesley, M.A., “An Algorithm for Planning Collision-Free Paths Among Polyhedral Obstacles” Commun. ACM 22, No. 10, 560–570 (10, 1979).CrossRef Google Scholar

42.Udupa, S.M., “Collision Detection and Avoidance in Computer Controlled Manipulators” Proc. 5th Int'l Joint Conf. on A.I. (08 22–25, M.I.T., 1977) pp. 737–748.Google Scholar

43.Gleason, G.J. and Algin, G.J., “A Modular Vision System for Sensor-Controlled Manipulation and Inspection” Proc. of 9th International Symposium on Industrial Robots, Washington, D.C. (03, 1979) pp. 57–70.Google Scholar

44.Kinnucan, P., “How Smart Robots are Becoming Smarter” High Technology, 1, No. 1, 42–40 (09/10, 1981).Google Scholar

45.Reinhold, A.G. and Vanderbrug, G., “Robot Vision for Industry: The Autovision System” Robotics Age 2, No. 3, 22–28 (Fall, 1980).Google Scholar

46.Rosenfeld, A. and Pfaltz, J.L., “Sequential Operations in Digital Picture Processing” J. ACM 13, No. 4, 471–494 (10, 1966).Google Scholar

47.Holland, S.W., Rossol, L. & Ward, M.R., “Consight-I: A Vision-Controlled Robot System for Transferring Parts from Belt Conveyors” In: Computer Vision and Sensor-Based Robotics (edited by Dodd, G.G. and Rossol, L.), (Plenum Press, 1979) pp. 81–100.CrossRef Google Scholar

48.Bolles, R.C. and Cain, R.A., “Recognising and Locating Partially Visible Objects: The Local-Feature-Focus Method”, Intern. J. Robot. Res. 1, No. 3, 57–82 (Fall, 1982).CrossRef Google Scholar

49.Gonzalez, R.C. and Wintz, P., Digital Image Processing (Addison-Wesley, London, 1977).Google Scholar

50.Kaneff, S. (Editor), Picture Language Machines (Academic Press, London, New York, 1970).Google Scholar

51.Haralick, R.M. and Simon, J.C. (Editors), Issues in Digital Image Processing (Sijthoff and Noordhoff, 1980).CrossRef Google Scholar

52.Pratt, W.K., Digital Image Processing (John Wiley and Sons, Chichester, 1978).Google Scholar

53.Rosenfeld, A. (Editor), Image Modelling (Academic Press, New York, 1981).Google Scholar

54.Stucki, P. (Editor), Advances in Digital Image Processing: Theory, Application, Implementation (Plenum Press, New York, 1979).CrossRef Google Scholar

55.Nake, F. and Rosenfeld, A. (Editors), Graphic Languages (North-Holland, Amsterdam, 1972).Google Scholar

56.Grasselli, A. (Editor), Automatic Interpretation and Classification of Images (Academic Press, New York, 1969).Google Scholar

57.Rosenfeld, A., Picture Processing by Computer (Academic Press, New York, 1969).CrossRef Google Scholar

58.Rosenfeld, A.and Kak, A.C., Digital Picture Processing (Academic Press, New York, 1976).Google Scholar

59.Hall, E.L., Computer Image Processing and Recognition, (Academic Press, New York, 1979).Google Scholar

60.Klinger, A., Fu, K-S. and Kunii, T.L. (Editors), Data Structures, Computer Graphics and Pattern Recognition (Academic Press, New York, 1977).Google Scholar

61.Huang, T.S. (Editor), Image Sequence Analysis (Springer-Verlag, Berlin, 1981).CrossRef Google Scholar

62.Aggarwal, J.K., Duda, R.O. and Rosenfeld, A. (Editors), Computer Methods in Image Analysis (IEEE Press, New York, 1977).Google Scholar

63.Perkins, W.A., “A Model-Based Vision System for Industrial Parts” IEEE Trans on Computers C27, No. 2, 126–143 (02, 1978).CrossRef Google Scholar

64.Kelly, R., Birk, J., Dessimoz, J., Martins, H. and Tella, R., “Acquiring Connecting Rod Castings Using a Robot with Vision and Sensors” Proc. 1st Int'l Conf. on Robot Vision and Sensory Controls, Stratford-upon-Avon, U.K. (04 1–3, 1981) pp. 169–178.Google Scholar

65.Guzman, A., “Computer Recognition of Three-Dimensional Objects in a Visual Scene” Ph.D. Thesis, MAC-TR-59, Project MAC, M.I.T. Mass (1968).Google Scholar

66.Waltz, D., “Generating Semantic Descriptions from Drawings of Scenes with Shadows” Ph.D. Thesis, M.I.T., Mass. (1972);Google Scholar

also Ch. 2 of The Psychology of Computer Vision (Winston, P.H., Ed.) (McGraw-Hill, New York, 1975).Google Scholar

67.Winston, P.H., “Learning Structural Descriptions from Examples”, Ch. 5 of The Psychology of Computer Vision (Winston, P.H., Ed.) (McGraw-Hill, New York, 1975).Google Scholar

68.Duda, R.O., “Some Current Techniques for Scene Analysis” Stanford Research Centre, A.I. Centre, Technical Note 46 (10 1970).Google Scholar

69.Nilsson, N.J., Problem-Solving Methods in Artificial Intelligence, (McGraw-Hill, New York, 1971).Google Scholar

70.Jarvis, R.A., “A Perspective on Rangefinding Techniques for Computer Vision” IEEE Trans. on Pattern Analysis and Machine Intelligence PAMI-5, No. 2, 122–139 (01 1983).CrossRef Google Scholar

71.Popplestone, R.J., Brown, C.M., Ambler, A.P. and Crawford, G.F., “Forming Models of Plane-and-Cylinder Faceted Bodies from Light Stripes” Proc. 4th Int'l. Joint Conf. A.I. (1975) pp. 664–668.Google Scholar

72.Crawford, G.F., “The Stripe Finder Hardware” (Dep. Artificial Intell., Univ. Edinburgh, 1974).Google Scholar

73.Rocker, F. and Kiessling, A., “Methods for Analysing Three Dimensional Scenes” Proc. 4th Int'l Joint Conf. A.I. (1975) pp. 669–673.Google Scholar

74.Will, P.M. and Pennington, N.S., “Grid Coding: A Preprocessing Technique for Robot and Machine Vision” Proc. 2nd Int'l Joint Conf. A.I. (09 1971) pp. 66–68.CrossRef Google Scholar

75.Rosenberg, D., Levine, M.D. and Zucker, S.W., “Computing Relative Depth Relationships from Occlusion Cues” Proc. 4th Int'l Joint Conf. Pattern Recognition, Kyoto, Japan (11, 7–10, 1978) pp. 765–769.Google Scholar

76.Marr, D. and Poggio, T., “Cooperative Computation of Stereo Disparity” Artificial Intelligence Memo 364, MIT A.I. Lab., Cambridge, Mass. (06 1976).CrossRef Google Scholar

77.Bajcsy, R. and Lieberman, L., “Texture Gradient as a Depth Cue” Computer Graphics and Image Processing 5, 52–67 (1976).CrossRef Google Scholar

78.Jarvis, R.A., “Focus Optimisation Criteria for Computer Image Processing” The Microscope 24, 162–179 (2nd., Quarter, 1976).Google Scholar

79.Nitzan, D., Brain, A.E. and Duda, R.O., “The Measurement and Use of Registered Reflectance and Range Data in Scene Analysis” Proc. IEEE, 65, No. 2, 206–220 (02, 1977).CrossRef Google Scholar

80.Lewis, R.A. and Johnston, A.R., “A Scanning Laser Rangefinder for a Robotic Vehicle”, Proc. 5th Int'l Joint Conf. A.I. (1977) pp. 762–768.Google Scholar

81.Will, P.M. and Pennington, K.S., “Grid Coding: A Preprocessing Technique for Robot and Machine Vision”, Proc. 2nd Joint Int'l Conf. A.I. (09 1971) pp. 66–70.CrossRef Google Scholar

82.Jarvis, R.A., “A Laser Time-of-Flight Range Scanner for Robotic Vision”, Dept. of Computer Science, Australian National University, Technical Report TR-CS–81–10. Also to appear in Trans IEEE on PAMI.Google Scholar

83.Shirai, Y. and Suwa, M., “Recognition of Polyhedrons with a Range Finder” Proc. 2nd Int'l Joint Conf. A.I. London (09 1971) pp. 80–87.Google Scholar

84.Agin, G.J. and Binford, T.O., “Computer Description of Curved Objects” Proc. Int'l Joint Conf. A.I., Stanford Univ. (08 20–23, 1973) pp. 629–640.Google Scholar

85.Yakimovsky, Y. and Cunningham, R., “A System for Extracting Three-Dimensional Measurements from a Stereo Pair of TV Cameras” Comput. Graphics Image Processing 7, 195–210 (1978).CrossRef Google Scholar

86.Moravec, H.P., “Visual Mapping by a Robot Rover” Proc. 6th Int'l Joint Conf. A.I. (1979) pp. 598–620.Google Scholar

87.Baker, H.H., “Edge Based Stereo Correlation” Proc. ARPA Image Understanding Workshop Univ. Maryland (04, 1980).Google Scholar

88.Julesz, B., “Binocular Depth Perception Without Familiarity Cues” Science 145, pp. 356–362 (1964).Google Scholar

89.Marr, D. and Hildreth, E.C., “Theory of Edge Detection” Proc. R. Soc. London B 207, 187–217 (1980).Google Scholar PubMed

90.Nishihara, H.K. and Larson, N.G., “Towards Real-Time Implementation of the Marr-Poggio Stereo Matcher” Proc. Image Understanding Workshop (edited by Lee, Baumann) (1981).Google Scholar

91.Jarvis, R.A., “Expedient Range Enhanced 3-D Robot Colour Vision” Robotica 1, 25–31 (1983).CrossRef Google Scholar

92.Marr, D., “Representing Visual Information - A Computational Approach” Computer Vision Systems (Hanson, A.R. and Riseman, E.M., Editors) (Academic Press, New York, 1978) pp. 61–80.Google Scholar

93.Page, C.J. and Hassan, H., “Non-Contact Inspection of Complex Components Using a Rangefinder Vision System” Proc. 1st Int'l Conf. on Robot Vision and Sensory Controls, Stratford-Upon-Avon, U.K. (1981) pp. 245–254.Google Scholar

94.Jarvis, R.A., “Robotic Vision Using 3D Space Cube Solid Modelling Derived from Multiple Image Projections”, submitted to 7th Australian Computer Science Conference, Adelaide (02 1984) pp. 18–1 to 18–11.Google Scholar

95.Loughlin, C. and Hudson, E., “Eye in Hand Robot Vision”, Proc. 2nd Int'l Conf. on Robot Vision and Sensory Controls, Stuttgart, Germany (11 24, 1982) pp. 263–270.Google Scholar

96.Agin, G.J., “Servoing with Visual Feedback”, S.R.I., Artificial Intelligence Centre, Technical Note 149 (07 1977).Google Scholar

97.High Technology 3, No. 4, 37 (04 1983).Google Scholar

Article contents

Application-oriented robotic vision – a review

Summary

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests