Search

Visual diagnostics for female genital schistosomiasis and the opportunity for improvement using computer vision
Morgan E. Lemin, Amaya L. Bustinduy, Chrissy h. Roberts
Journal:

Parasitology , First View

Published online by Cambridge University Press:

12 September 2025, pp. 1-12
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Female genital schistosomiasis (FGS) is a chronically disabling gynaecological condition, impacting up to 56 million women and girls, mostly in sub-Saharan Africa. In lieu of a gold standard laboratory test, it is possible to diagnose FGS visually. Visual diagnosis is performed through inspection of the cervix and surrounding tissue to identify signs of Schistosoma egg deposition, associated inflammation and granuloma formation. The change related to egg deposition can be very subtle and heterogeneous and is often seen in the context of other altered cervical morphology. Visual diagnostics for FGS are therefore currently highly subjective and lack specificity, with low consistency of grading between trained expert reviewers. Computer vision, driven by artificial intelligence, is an enticing prospect to overcome these issues due to the potential to accurately detect and classify the subtle changes and patterns that are indiscernible to human graders. Computer vision also offers the opportunity to support resource-constrained regions with few staff trained on visual diagnostics. However, several challenges stand in the way of progressing and successfully implementing computer vision tools for FGS. These challenges are particularly related to the variation in the appearance of the cervix (with or without disease) and FGS lesions, as well as the difficulty with accurately labelling cervical images. Exploring alternative annotation methods and model architectures is likely to improve the performance of FGS computer vision tools. This paper will explore the challenges of FGS computer vision and provide suggestions on how to overcome these barriers to enhance visual diagnostics for FGS.

Detecting Formatted Text: Data Collection Using Computer Vision
Jonathan Colner
Journal:

Political Analysis , First View

Published online by Cambridge University Press:

31 July 2025, pp. 1-7
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Research in political science has begun to explore how to use large language and object detection models to analyze text and visual data. However, few studies have explored how to use these tools for data extraction. Instead, researchers interested in extracting text from poorly formatted sources typically rely on optical character recognition and regular expressions or extract each item by hand. This letter describes a workflow process for structured text extraction using free models and software. I discuss the type of data best suited to this method, its usefulness within political science, and the steps required to convert the text into a usable dataset. Finally, I demonstrate the method by extracting agenda items from city council meeting minutes. I find the method can accurately extract subsections of text from a document and requires only a few hand labeled documents to adequately train.

Development of a robust deep learning model for weed classification across diverse bermudagrass turfgrass regimes in China and the United States
Xiaotong Kong, Kang Han, Teng Liu, Aniruddha Maity, Aimin Li, Xiaojun Jin, Jialin Yu
Journal:

Weed Science / Volume 73 / Issue 1 / 2025

Published online by Cambridge University Press:

25 June 2025, e50
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Automatic precision herbicide application offers significant potential for reducing herbicide use in turfgrass weed management. However, developing accurate and reliable neural network models is crucial for achieving optimal precision weed control. The reported neural network models in previous research have been limited by specific geographic regions, weed species, and turfgrass management practices, restricting their broader applicability. The objective of this research was to evaluate the feasibility of deploying a single, robust model for weed classification across a diverse range of weed species, considering variations in species, ecotypes, densities, and growth stages in bermudagrass turfgrass systems across different regions in both China and the United States. Among the models tested, ResNeXt152 emerged as the top performer, demonstrating strong weed detection capabilities across 24 geographic locations and effectively identifying 14 weed species under varied conditions. Notably, the ResNeXt152 model achieved an F1 score and recall exceeding 0.99 across multiple testing scenarios, with a Matthews correlation coefficient (MCC) value surpassing 0.98, indicating its high effectiveness and reliability. These findings suggest that a single neural network model can reliably detect a wide range of weed species in diverse turf regimes, significantly reducing the costs associated with model training and confirming the feasibility of using one model for precision weed control across different turf settings and broad geographic regions.

Precision agriculture can foster cropland weed diversity and protect endangered species
Muthukumar V. Bagavathiannan, Christoph von Redwitz
Journal:

Weed Science / Volume 73 / Issue 1 / 2025

Published online by Cambridge University Press:

18 June 2025, e48
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Weed diversity plays an important role in the functioning of agroecosystems. Moreover, a number of endangered/threatened plant species occur as weeds in arable fields and/or field boundaries. Agricultural intensification has imposed negative consequences on weed diversity in general, and the survival of the endangered/threatened plant species in particular. The objective of this review is to provide a theoretical framework for promoting cropland weed diversity through precision agriculture. A systematic review was conducted based on literature analysis, existing knowledge gaps, and current needs to identify a suitable approach for promoting cropland biodiversity while protecting crop yields. While nonchemical weed management methods and economic threshold–based approaches are touted to improve weed diversity, they are either ineffective or insufficient for this purpose; long-term economic consequences and the risk of weed adaptation are major concerns. A plant functional trait-based approach to promoting weed diversity, one that considers a plant’s ecosystem service potential and competitiveness with the crop, among other factors, has been proposed by researchers. This approach has tremendous potential for weed diversity conservation in commercial production systems, but field implementation has been limited thus far due to our inability to selectively control weeds at the individual-plant level. However, recent advancements in computer vision, machine learning, and site-specific weed management technologies may allow for the accurate elimination of unwanted plants while retaining the important ones. Here, we present a novel framework for the utilization of precision agriculture for the conservation of cropland weed diversity, including the protection of endangered/threatened plant species, while protecting crop yields. This approach is the first of its kind in which the control priority is ranked on an individual-plant basis, by integrating intrinsic weed trait values with field infestation characteristics, while management thresholds are tailored to specific goals and priorities.

Quantifying sitting posture: A pilot feasibility study of computer vision and wearable sensors (Posture Lab) using a manikin model
Supachai Vorapojpisut, Suphawit Sansuk, Phoomtai Yindee, Darawadee Panich, Vinitha Puengtanom, Sairag Saadprai
Journal:

Wearable Technologies / Volume 6 / 2025

Published online by Cambridge University Press:

16 June 2025, e27
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Posture-related musculoskeletal issues among office workers are a significant health concern, mainly due to long periods spent in static positions. This research presents a Posture Lab which is a workplace-based solution through an easy-to-use posture monitoring system, allowing employees to assess their posture. The Posture Lab focuses on two key aspects: Normal Head Posture (NHP) versus Forward Head Posture (FHP) measurement and thoracic spine kyphosis. Craniovertebral (CA) and Shoulder Angles (SA) quantify NHP and FHP. The Kyphosis Angle (KA) is for measuring normal thoracic spine and kyphosis. To measure these angles, the system uses computer vision technology with ArUco markers detection via a webcam to analyze head positions. Additionally, wearable accelerometer sensors measure kyphosis by checking the angles of inclination. The framework includes a web-based user interface for registration and specialized desktop applications for different measurement protocols. A RESTful API enables system communication and centralized data storage for reporting. The Posture Lab serves as an effective tool for organizations to evaluate employee postures and supports early intervention strategies, allowing timely referrals to healthcare providers if any potential musculoskeletal issues are identified. The Posture Lab has also shown medium to very high correlations with standard 2D motion analysis methods – Kinovea – for CA, SA, and KA in FHP with kyphosis measurements (r = 0.607, 0.704, and 0.992) and shown high to very high correlations in NHP with normal thoracic spine measurements (r = 0.809, 0.748, and 0.778), with significance at p < .01, utilizing the Pearson correlation coefficient.

The Art of Inpainting

Mathematical Methods for the Virtual Restoration of Illuminated Manuscripts
Simone Parisotto, Patricia Vitoria, Coloma Ballester, Aurélie Bugeau, Suzanne Reynolds, Carola-Bibiane Schönlieb
Published online:

22 May 2025

Print publication:

22 May 2025
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The art of image restoration and completion has entered a new phase thanks to digital technology. Indeed, virtual restoration is sometimes the only feasible option available to us, and it has, under the name 'inpainting', grown, from methods developed in the mathematics and computer vision communities, to the creation of tools used routinely by conservators and historians working in the worlds of fine art and cinema. The aim of this book is to provide, for a broad audience, a thorough description of imaging inpainting techniques. The book has a two-layer structure. In one layer, there is a general and more conceptual description of inpainting; in the other, there are boxed descriptions of the essentials of the mathematical and computational details. The idea is that readers can easily skip those boxes without disrupting the narrative. Examples of how the tools can be used are drawn from the Fitzwilliam Museum, Cambridge collections.

Mapping housing stock characteristics from drone images for climate resilience in the Caribbean
Part of
- Climate Informatics 2024
Isabelle Tingzon, Nuala Margaret Cowan, Pierre Chrzanowski
Journal:

Environmental Data Science / Volume 3 / 2024

Published online by Cambridge University Press:

02 January 2025, e29
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Comprehensive housing stock information is crucial for informing the development of climate resilience strategies aiming to reduce the adverse impacts of extreme climate hazards in high-risk regions like the Caribbean. In this study, we propose an end-to-end workflow for rapidly generating critical baseline exposure data using very high-resolution drone imagery and deep learning techniques. Specifically, our work leverages the segment anything model (SAM) and convolutional neural networks (CNNs) to automate the generation of building footprints and roof classification maps. We evaluate the cross-country generalizability of the CNN models to determine how well models trained in one geographical context can be adapted to another. Finally, we discuss our initiatives for training and upskilling government staff, community mappers, and disaster responders in the use of geospatial technologies. Our work emphasizes the importance of local capacity building in the adoption of AI and Earth Observation for climate resilience in the Caribbean.

Evaluating enrichment use in group-housed rhesus macaques (Macaca mulatta): A machine learning approach
Part of
- AWF Primates Collection
Giulia Ciminelli, Claire Witham, Melissa Bateson
Journal:

Animal Welfare / Volume 33 / 2024

Published online by Cambridge University Press:

09 December 2024, e59
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Environmental enrichment programmes are widely used to improve welfare of captive and laboratory animals, especially non-human primates. Monitoring enrichment use over time is crucial, as animals may habituate and reduce their interaction with it. In this study we aimed to monitor the interaction with enrichment items in groups of rhesus macaques (Macaca mulatta), each consisting of an average of ten individuals, living in a breeding colony. To streamline the time-intensive task of assessing enrichment programmes we automated the evaluation process by using machine learning technologies. We built two computer vision-based pipelines to evaluate monkeys’ interactions with different enrichment items: a white drum containing raisins and a non-food-based puzzle. The first pipeline analyses the usage of enrichment items in nine groups, both when it contains food and when it is empty. The second pipeline counts the number of monkeys interacting with a puzzle across twelve groups. The data derived from the two pipelines reveal that the macaques consistently express interest in the food-based white drum enrichment, even several months after its introduction. The puzzle enrichment was monitored for one month, showing a gradual decline in interaction over time. These pipelines are valuable for assessing enrichment by minimising the time spent on animal observation and data analysis; this study demonstrates that automated methods can consistently monitor macaque engagement with enrichments, systematically tracking habituation responses and long-term effectiveness. Such advancements have significant implications for enhancing animal welfare, enabling the discontinuation of ineffective enrichments and the adaptation of enrichment plans to meet the animals’ needs.

Improving Computer Vision Interpretability: Transparent Two-Level Classification for Complex Scenes
Stefan Scholz, Nils B. Weidmann, Zachary C. Steinert-Threlkeld, Eda Keremoğlu, Bastian Goldlücke
Journal:

Political Analysis / Volume 33 / Issue 2 / April 2025

Published online by Cambridge University Press:

09 December 2024, pp. 107-121
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Treating images as data has become increasingly popular in political science. While existing classifiers for images reach high levels of accuracy, it is difficult to systematically assess the visual features on which they base their classification. This paper presents a two-level classification method that addresses this transparency problem. At the first stage, an image segmenter detects the objects present in the image and a feature vector is created from those objects. In the second stage, this feature vector is used as input for standard machine learning classifiers to discriminate between images. We apply this method to a new dataset of more than 140,000 images to detect which ones display political protest. This analysis demonstrates three advantages to this paper’s approach. First, identifying objects in images improves transparency by providing human-understandable labels for the objects shown on an image. Second, knowing these objects enables analysis of which distinguish protest images from non-protest ones. Third, comparing the importance of objects across countries reveals how protest behavior varies. These insights are not available using conventional computer vision classifiers and provide new opportunities for comparative research.

WeedScan, a weed reporting system for Australia using an image classification model for identification
Alexander N. Schmidt-Lebuhn, Matt Bell, Carsten Eckelmann, Dane Evans, Andreas Glanznig, Rongxin Li, Andrew Mitchell, Tomas Mitchell-Storey, Michael Newton, Liam O’Duibhir, Richard Southerton, Emily Thomas, Hanwen Wu
Journal:

Invasive Plant Science and Management / Volume 17 / Issue 3 / September 2024

Published online by Cambridge University Press:

14 November 2024, pp. 219-227
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Fast and efficient identification is critical for reducing the likelihood of weed establishment and for appropriately managing established weeds. Traditional identification tools require either knowledge of technical morphological terminology or time-consuming image matching by the user. In recent years, deep learning computer vision models have become mature enough to enable automatic identification. The major remaining bottlenecks are the availability of a sufficient number of high-quality, reliably identified training images and the user-friendly, mobile operationalization of the technology. Here, we present the first weed identification and reporting app and website for all of Australia. It includes an image classification model covering more than 400 species of weeds and some Australian native relatives, with a focus on emerging biosecurity threats and spreading weeds that can still be eradicated or contained. It links the user to additional information provided by state and territory governments, flags species that are locally reportable or notifiable, and allows the creation of observation records in a central database. State and local weed officers can create notification profiles to be alerted of relevant weed observations in their area. We discuss the background of the WeedScan project, the approach taken in design and software development, the photo library used for training the WeedScan image classifier, the model itself and its accuracy, and technical challenges and how these were overcome.

RGB-D visual odometry by constructing and matching features at superpixel level
Meiyi Yang, Junlin Xiong, Youfu Li
Journal:

Robotica / Volume 42 / Issue 8 / August 2024

Published online by Cambridge University Press:

18 September 2024, pp. 2619-2634
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Visual odometry (VO) is a key technology for estimating camera motion from captured images. In this paper, we propose a novel RGB-D visual odometry by constructing and matching features at the superpixel level that represents better adaptability in different environments than state-of-the-art solutions. Superpixels are content-sensitive and perform well in information aggregation. They could thus characterize the complexity of the environment. Firstly, we designed the superpixel-based feature SegPatch and its corresponding 3D representation MapPatch. By using the neighboring information, SegPatch robustly represents its distinctiveness in various environments with different texture densities. Due to the inclusion of depth measurement, the MapPatch constructs the scene structurally. Then, the distance between SegPatches is defined to characterize the regional similarity. We use the graph search method in scale space for searching and matching. As a result, the accuracy and efficiency of matching process are improved. Additionally, we minimize the reprojection error between the matched SegPatches and estimate camera poses through all these correspondences. Our proposed VO is evaluated on the TUM dataset both quantitatively and qualitatively, showing good balance to adapt to the environment under different realistic conditions.

Learning vision-based robotic manipulation tasks sequentially in offline reinforcement learning settings
Sudhir Pratap Yadav, Rajendra Nagar, Suril V. Shah
Journal:

Robotica / Volume 42 / Issue 6 / June 2024

Published online by Cambridge University Press:

02 May 2024, pp. 1715-1730
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
With the rise of deep reinforcement learning (RL) methods, many complex robotic manipulation tasks are being solved. However, harnessing the full power of deep learning requires large datasets. Online RL does not suit itself readily into this paradigm due to costly and time-consuming agent-environment interaction. Therefore, many offline RL algorithms have recently been proposed to learn robotic tasks. But mainly, all such methods focus on a single-task or multitask learning, which requires retraining whenever we need to learn a new task. Continuously learning tasks without forgetting previous knowledge combined with the power of offline deep RL would allow us to scale the number of tasks by adding them one after another. This paper investigates the effectiveness of regularisation-based methods like synaptic intelligence for sequentially learning image-based robotic manipulation tasks in an offline-RL setup. We evaluate the performance of this combined framework against common challenges of sequential learning: catastrophic forgetting and forward knowledge transfer. We performed experiments with different task combinations to analyse the effect of task ordering. We also investigated the effect of the number of object configurations and the density of robot trajectories. We found that learning tasks sequentially helps in the retention of knowledge from previous tasks, thereby reducing the time required to learn a new task. Regularisation-based approaches for continuous learning, like the synaptic intelligence method, help mitigate catastrophic forgetting but have shown only limited transfer of knowledge from previous tasks.

Dairy farming in the era of artificial intelligence: trend or a real game changer?
Oscar R. Espinoza-Sandoval, Juan Carlos Angeles-Hernandez, Manuel Gonzalez-Ronquillo, Navid Ghavipanje, Naifeng Zhang, A. R. Bayat, Gonzalo Hervás, Ahmed E. Kholif, Marcello Mele, Juan J. Loor, Sokratis Stergiadis, Einar Vargas-Bello-Pérez
Journal:

Journal of Dairy Research / Volume 91 / Issue 2 / May 2024

Published online by Cambridge University Press:

05 December 2024, pp. 139-145

Print publication:

May 2024
- Article
- - You have access
- PDF
- HTML
- Export citation
Artificial Intelligence (AI) is reshaping the world as we know it, impacting all aspects of modern society, basically due to the advances in computer power, data availability and AI algorithms. The dairy sector is also on the move, from the exponential growth in AI research, to ready to use AI-based products, this new evolution to Dairy 4.0 represents a potential ‘game-changer’ for the dairy sector, to confront challenges regarding sustainability, welfare, and profitability. This research reflection explores the possible impact of AI, discusses the main drivers in the field and describes its origins, challenges, and opportunities. Further, we present a multidimensional vision considering factors that are not commonly considered in dairy research, such as geopolitical aspects and legal regulations that can have an impact on the application of AI in the dairy sector. This is just the beginning of the third tide of AI, and a future is still ahead. For now, the current advances in AI at on-farm level seem limited and based on the revised data, we believe that AI can be a ‘game-changer’ only if it is integrated with other components of Dairy 4.0 (such as robotics) and is fully adopted by dairy farmers.

3 - FRT in ‘Bloom’
from Part I - Facial Recognition Technology in Context
- By Simon Michael Taylor
Edited by Rita Matulionyte, Macquarie University, Sydney, Monika Zalnieriute, University of New South Wales, Sydney
Book:

The Cambridge Handbook of Facial Recognition in the Modern State

Published online:

28 March 2024

Print publication:

04 April 2024, pp 44-59
- Chapter
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Summary

This chapter introduces the reader to facial recognition technology (FRT) history and the development of FRT from the perspective of science and technologies studies. Beginning with the traditionally accepted origins of FRT in 1964–1965, developed by Woody Bledsoe, Charles Bisson, and Helen Wolf Chan in the United States, Simon Taylor discusses how FRT builds on earlier applications in mug shot profiling, imaging, biometrics, and statistical categorisation. Grounded in the history of science and technology, the chapter demonstrates how critical aspects of FRT infrastructure are aided by scientific and cultural innovations from different times of locations: that is, mugshots in eighteenth-century France; mathematical analysis of caste in nineteenth-century British India; innovations by Chinese closed-circuit television companies and computer vision start-ups conducting bio-security experiments on farm animals. This helps to understand FRT development beyond the United States-centred narrative. The aim is to deconstruct historical data, mathematical, and digital materials that act as ‘back-stage elements’ to FRT and are not so easily located in infrastructure yet continue to shape uses today. Taylor’s analysis lays a foundation for the kinds of frameworks that can better help regulate and govern FRT as a means for power over populations in the following chapters.

Vine variety identification through leaf image classification: a large-scale study on the robustness of five deep learning models
D. De Nart, M. Gardiman, V. Alba, L. Tarricone, P. Storchi, S. Roccotelli, M. Ammoniaci, V. Tosi, R. Perria, R. Carraro
Journal:

The Journal of Agricultural Science / Volume 162 / Issue 1 / February 2024

Published online by Cambridge University Press:

12 February 2024, pp. 19-32
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Varietal identification plays a pivotal role in viticulture for several purposes. Nowadays, such identification is accomplished using ampelography and molecular markers, techniques requiring specific expertise and equipment. Deep learning, on the other hand, appears to be a viable and cost-effective alternative, as several recent studies claim that computer vision models can identify different vine varieties with high accuracy. Such works, however, limit their scope to a handful of selected varieties and do not provide accurate figures for external data validation. In the current study, five well-known computer vision models were applied to leaf images to verify whether the results presented in the literature can be replicated over a larger data set consisting of 27 varieties with 26 382 images. It was built over 2 years of dedicated field sampling at three geographically distinct sites, and a validation data set was collected from the Internet. Cross-validation results on the purpose-built data set confirm literature results. However, the same models, when validated against the independent data set, appear unable to generalize over the training data and retain the performances measured during cross validation. These results indicate that further enhancement have been done in filling such a gap and developing a more reliable model to discriminate among grape varieties, underlining that, to achieve this purpose, the image resolution appears to be a crucial factor in the development of such models.

Image-based anti-interference robotic Chinese character writing system
Xian Li, Chenguang Yang, Sheng Xu, Yongsheng Ou
Journal:

Robotica / Volume 42 / Issue 3 / March 2024

Published online by Cambridge University Press:

22 January 2024, pp. 931-945
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This article designs a robotic Chinese character writing system that can resist random human interference. Firstly, an innovative stroke extraction method of Chinese characters was devised. A basic Chinese character stroke extraction method based on cumulative direction vectors is used to extract the components that make up the strokes of Chinese characters. The components are then stitched together into strokes based on the sequential base stroke joining method. To enable the robot to imitate handwriting Chinese character skills, we utilised stroke information as the demonstration and modelled the skills using dynamic movement primitives (DMPs). To suppress random human interference, this article combines improved DMPs and conductance control to adjust robot trajectories based on real-time visual measurements. The experimental results show that the proposed method can accurately extract the strokes of most Chinese characters. The designed trajectory adjustment method offers better smoothness and robustness than direct rotating and translating curves. The robot is able to adjust its posture and trajectory in real time to eliminate the negative impacts of human interference.

ShakingBot: dynamic manipulation for bagging
Ningquan Gu, Zhizhong Zhang, Ruhan He, Lianqing Yu
Journal:

Robotica / Volume 42 / Issue 3 / March 2024

Published online by Cambridge University Press:

04 January 2024, pp. 775-791
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Bag manipulation through robots is complex and challenging due to the deformability of the bag. Based on the dynamic manipulation strategy, we propose a new framework, ShakingBot, for the bagging tasks. ShakingBot utilizes a perception module to identify the key region of the plastic bag from arbitrary initial configurations. According to the segmentation, ShakingBot iteratively executes a novel set of actions, including Bag Adjustment, Dual-arm Shaking, and One-arm Holding, to open the bag. The dynamic action, Dual-arm Shaking, can effectively open the bag without the need to take into account the crumpled configuration. Then, the robot inserts the items and lifts the bag for transport. We perform our method on a dual-arm robot and achieve a success rate of 21/33 for inserting at least one item across various initial bag configurations. In this work, we demonstrate the performance of dynamic shaking action compared to the quasi-static manipulation in the bagging task. We also show that our method generalizes to variations despite the bag’s size, pattern, and color. Supplementary material is available at https://github.com/zhangxiaozhier/ShakingBot.

A variable stiffness robotic gripper based on parallel beam with vision-based force sensing for flexible grasping
Part of
- The 40th Anniversary of Robotica
Jiaming Fu, Ziqing Yu, Qianyu Guo, Lianxi Zheng, Dongming Gan
Journal:

Robotica / Volume 42 / Issue 12 / December 2024

Published online by Cambridge University Press:

12 December 2023, pp. 4036-4054
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The demand for flexible grasping of various objects by robotic hands in the industry is rapidly growing. To address this, we propose a novel variable stiffness gripper (VSG). The VSG design is based on a parallel-guided beam structure inserted by a slider from one end, allowing stiffness variation by changing the length of the parallel beams participating in the system. This design enables continuous adjustment between high compliance and high stiffness of the gripper fingers, providing robustness through its mechanical structure. The linear analytical model of the deflection and stiffness of the parallel beam is derived, which is suitable for small and medium deflections. The contribution of each parameter of the parallel beam to the stiffness is analyzed and discussed. Also, a prototype of the VSG is developed, achieving a stiffness ratio of 70.9, which is highly competitive. Moreover, a vision-based force sensing method utilizing ArUco markers is proposed as a replacement for traditional force sensors. By this method, the VSG is capable of closed-loop control during the grasping process, ensuring efficiency and safety under a well-defined grasping strategy framework. Experimental tests are conducted to emphasize the importance and safety of stiffness variation. In addition, it shows the high performance of the VSG in adaptive grasping for asymmetric scenarios and its ability to flexible grasping for objects with various hardness and fragility. These findings provide new insights for future developments in the field of variable stiffness grippers.

Chapter 11 - Computer Vision for the Study of Older (and Younger) Adult Faces
from Part V - Methodological Approaches to the Study of the Effects of Aging on Emotion Communication
- By Daniel N. Albohn, Joseph C. Brandenburg, Reginald B. Adams
Edited by Ursula Hess, Humboldt-Universität zu Berlin, Reginald B. Adams, Jr., Pennsylvania State University, Robert E. Kleck, Dartmouth College, New Hampshire
Book:

Emotion Communication by the Aging Face and Body

Published online:

07 December 2023

Print publication:

07 December 2023, pp 265-285
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Computer vision and machine learning are rapidly advancing fields of study. For better or worse, these tools have already permeated our everyday lives and are used for everything from auto-tagging social media images to curating what we view in our news feed. In this chapter, we discuss historical and contemporary approaches used to study face recognition, detection, manipulation, and generation. We frame our discussion within the context of how this work has been applied to the study of older adults, but also acknowledge that more work is needed both within this domain as well as at its intersection with, e.g., race and gender. Throughout the chapter we review, and at the end offer links to (Table 11.1), a number of resources that researchers can start using now in their research. We also discuss ongoing concerns related to the ethics of artificial intelligence and to using this emerging technology responsibly.

Face Detection, Tracking, and Classification from Large-Scale News Archives for Analysis of Key Political Figures
Andreu Girbau, Tetsuro Kobayashi, Benjamin Renoust, Yusuke Matsui, Shin’ichi Satoh
Journal:

Political Analysis / Volume 32 / Issue 2 / April 2024

Published online by Cambridge University Press:

06 November 2023, pp. 221-239
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Analyzing the appearances of political figures in large-scale news archives is increasingly important with the growing availability of large-scale news archives and developments in computer vision. We present a deep learning-based method combining face detection, tracking, and classification, which is particularly unique because it does not require any re-training when targeting new individuals. Users can feed only a few images of target individuals to reliably detect, track, and classify them. Extensive validation of prominent political figures in two news archives spanning 10 to 20 years, one containing three U.S. cable news and the other including two major Japanese news programs, consistently shows high performance and flexibility of the proposed method. The codes are made readily available to the public.

Search Results

Refine search

Refine search

Actions for selected content:

81 results

Visual diagnostics for female genital schistosomiasis and the opportunity for improvement using computer vision

Detecting Formatted Text: Data Collection Using Computer Vision

Development of a robust deep learning model for weed classification across diverse bermudagrass turfgrass regimes in China and the United States

Precision agriculture can foster cropland weed diversity and protect endangered species

Quantifying sitting posture: A pilot feasibility study of computer vision and wearable sensors (Posture Lab) using a manikin model

The Art of Inpainting

Mapping housing stock characteristics from drone images for climate resilience in the Caribbean

Evaluating enrichment use in group-housed rhesus macaques (Macaca mulatta): A machine learning approach

Improving Computer Vision Interpretability: Transparent Two-Level Classification for Complex Scenes

WeedScan, a weed reporting system for Australia using an image classification model for identification

RGB-D visual odometry by constructing and matching features at superpixel level

Learning vision-based robotic manipulation tasks sequentially in offline reinforcement learning settings

Dairy farming in the era of artificial intelligence: trend or a real game changer?

3 - FRT in ‘Bloom’

Summary

Vine variety identification through leaf image classification: a large-scale study on the robustness of five deep learning models

Image-based anti-interference robotic Chinese character writing system

ShakingBot: dynamic manipulation for bagging

A variable stiffness robotic gripper based on parallel beam with vision-based force sensing for flexible grasping

Chapter 11 - Computer Vision for the Study of Older (and Younger) Adult Faces

Summary

Face Detection, Tracking, and Classification from Large-Scale News Archives for Analysis of Key Political Figures

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

81 results

The Art of Inpainting

Summary

Summary