LAUSR: human feedback

Helpful, harmless, honest? Sociotechnical limits of AI alignment and safety through Reinforcement Learning from Human Feedback

Sign Up to like & get
recommendations!
0 Published in 2025 at "Ethics and Information Technology"

DOI: 10.1007/s10676-025-09837-2

Abstract: This paper critically evaluates the attempts to align Artificial Intelligence (AI) systems, especially Large Language Models (LLMs), with human values and intentions through Reinforcement Learning from Feedback methods, involving either human feedback (RLHF) or AI… read more here.

Keywords: helpful harmless; feedback; human feedback; reinforcement learning ... See more keywords

How human–AI feedback loops alter human perceptual, emotional and social judgements

Sign Up to like & get
recommendations!
0 Published in 2024 at "Nature Human Behaviour"

DOI: 10.1038/s41562-024-02077-2

Abstract: Artificial intelligence (AI) technologies are rapidly advancing, enhancing human capabilities across various fields spanning from finance to medicine. Despite their numerous advantages, AI systems can exhibit biased judgements in domains ranging from perception to emotion.… read more here.

Keywords: social judgements; perceptual emotional; feedback; human perceptual ... See more keywords

Shade Artifact Reduction in CBCT-to-MDCT: Fine-Tuning Based on Style Transfer and Human Feedback

Sign Up to like & get
recommendations!
0 Published in 2025 at "IEEE Access"

DOI: 10.1109/access.2025.3552063

Abstract: Cone beam computed tomography (CBCT) is widely used in dental treatment due to its low radiation dose and cost. However, it has lower image quality compared to Multi Detector Computed Tomography (MDCT), limiting its use… read more here.

Keywords: cbct; style transfer; human feedback; mdct ... See more keywords

LAUSR

You are not signed in:

Sign Up!

Helpful, harmless, honest? Sociotechnical limits of AI alignment and safety through Reinforcement Learning from Human Feedback

How human–AI feedback loops alter human perceptual, emotional and social judgements

Shade Artifact Reduction in CBCT-to-MDCT: Fine-Tuning Based on Style Transfer and Human Feedback