Highest Voted 'value-alignment' Questions - Artificial Intelligence Stack Exchange

7

votes

4 answers

What are the reasons to belief AGI will not be dangerous?

We are in the middle of an ongoing debate about the safety of AGI and our current approach towards this technology. As summary, some quotes from a recent article from Time magazine: Many researchers[...] expect that the most likely result of…

asked Apr 03 '23 at 10:36

Martin

178
5

5

votes

2 answers

Is it possible to build an AI that learns humanity, morally?

It is a new era and people are trying to evolve more in science and technology. Artificial Intelligent is one of the ways to achieve this. We have seen lots of examples for AI sequences or a simple "communication AI" that are able to think by…

philosophy ethics superintelligence value-alignment

asked May 01 '18 at 16:04

Scarlet62442

53
3

3

votes

1 answer

Should we focus more on societal or technical issues with AI risk

I have trouble finding material (blog, papers) about this issue, so I'm posting here. Taking a recent well known example: Musk has tweeted and warned about the potential dangers of AI, saying it is "potentially more dangerous than nukes", referring…

agi social neo-luddism value-alignment risk-management

asked Sep 24 '18 at 14:52

Benjamin Crouzier

311
2
6

3

votes

2 answers

How will an AI comprehend the ethics of "right" and "wrong"?

Here is one of the most serious questions, about the artificial intelligence. How will the machine know the difference between right and wrong, what is good and bad, what is respect, dignity, faith and empathy. A machine can recognize what is…

ethics value-alignment

asked Oct 03 '16 at 02:27

iamroot ipcsdemo

59
7

2

votes

1 answer

Alignment drift in LLM's

In AI security discussions I have sometimes heard that an aligned AI may drift, but I didn't find any papers which report this phenomena for current LLM's. I have found papers about LLM's faking alignment and scheming, but nothing specific about…

ai-security value-alignment

asked Apr 14 '25 at 10:09

user47175

23
3

1

vote

2 answers

Is there serious game-theoretic work on AI risk and alignment?

My background is in political economy and game theory. I am interested in the discussion on AI risk and alignment, but I have so far failed to find work on this that seriously engages with classic axiomatic rational choice theory (RCT). Some claims…

game-theory value-alignment risk-management

asked Nov 28 '23 at 17:44

giorgio farace

11
2

1

vote

0 answers

Does human attention finitude make impossible to control an expanding AI?

The feedback given by humans to align artificial intelligence is limited by the reaction time and processing speed of the finite number of us, now less than $2^{33}$. As an artificial intelligence (or a growing number of them) grows in complexity,…

value-alignment

asked Apr 08 '23 at 07:44

Jaume Oliver Lafont

876
7
16

1

vote

1 answer

Solve the AI alignment problem using (meta-level) AI itself?

If the AI alignment problem is one of the most pressing issues of our time, could AI itself augment our (i.e., human) quest to solve the alignment problem? Or would AI itself actually be counter-productive for such a meta-level goal?

agi meta-heuristics value-alignment meta-rules

asked Feb 15 '23 at 04:28

Hank Igoe

111
4

0

votes

1 answer

Teaching AI to respect human physical integrity through haptics?

I’m not a specialist, but I’m curious about AI and security. I was thinking: can we teach AI to understand the human physical body and respect it, to prevent issues like in the paperclip dilemma? Maybe using haptic interfaces to teach AI about…

ethics value-alignment human-computer-interaction

asked Oct 19 '24 at 23:12

Ccile Clovereign P

1

0

votes

2 answers

The only convergent instrumental goal for self modifying AI

Conjecture: regardless of the initial reward function, one of the winning strategies would be to change the reward function to a simpler one (e.g. "do nothing"), thus getting a full reward for each passing unit of time. For such an agent, the only…

agi ai-safety value-alignment

asked Jun 27 '21 at 03:15

Andrew Butenko

221
1
6

0

votes

1 answer

Why is the Universal Declaration of Human Rights not included as statement on the AI?

Lots of people are afraid of what strong AI could mean for the human race. Some people wish for a sort of "Asimov law" included in the AI code, but maybe we could go a bit more far with the UDHR. So, Why is the Universal Declaration of Human Rights…

ai-design ethics logic value-alignment

asked Dec 01 '16 at 19:54

aurelien

101
6

Questions tagged [value-alignment]