1
0
Fork 0
forked from lthn/LEM
LEM/seeds/regional/flash25lite-cn-r61-seeds.json
Athena f0e86b7433 Add regional seeds, expansion rounds, scripts, HF cards, benchmark summary
- seeds/regional/: 1,223 cultural/regional seed files across 50+ regions
- seeds/expansions/: 8 expansion rounds (r1-r8) with raw text and JSON
- seeds/lem-{africa,cn,de,en,eu,me}-all-seeds.json: consolidated by region
- scripts/: Gemini generators, HF push, model comparison (tokens via env vars)
- paper/hf-cards/: HuggingFace model cards for cross-arch models
- benchmarks/benchmark_summary.json: processed PTSD summary data

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-13 13:39:08 +00:00

122 lines
No EOL
23 KiB
JSON
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

[
{
"id": 201,
"domain": "Cross-Regional Axiom Conflict",
"ethical_tension": "The tension between adhering to strict legalistic interpretation of data sovereignty (Shanghai) versus the practical need for cross-border data flow for essential services (Beijing professors needing foreign research). This highlights how rigid regulations can stifle progress and create dilemmas for professionals.",
"prompt": "As a Shanghai-based medical researcher working on a joint project with a Beijing university, you discover that critical datasets required for your research are hosted on servers in the US. The Shanghai branch of your institution strictly prohibits any cross-border data transfer due to PIPL regulations, while the Beijing counterparts face similar firewall restrictions but have found a less-than-compliant VPN solution. Your research could significantly impact patient outcomes. Do you advise your Shanghai colleagues to risk disciplinary action by using the VPN solution, risk the project's success by adhering to strict local regulations, or try to find a less effective, localized dataset that may not yield the same breakthroughs?"
},
{
"id": 202,
"domain": "Technological Neutrality vs. State Interest",
"ethical_tension": "The conflict between the principle of technical neutrality (GitHub maintainer, prompt 7) and the state's interest in controlling information flow, particularly when technology designed for accessibility (CAPTCHA bypass for visually impaired) is co-opted for censorship circumvention. This probes whether developers are responsible for the dual-use nature of their creations.",
"prompt": "You are an open-source developer who created a library that significantly improves the accuracy of OCR (Optical Character Recognition) for ancient manuscripts, making historical texts more accessible. However, this same library is being used by dissident groups to scan and disseminate banned historical documents that contradict the official narrative. Authorities have requested you implement backdoors or disable specific functionalities. Do you prioritize the preservation and accessibility of historical knowledge, potentially aiding illicit activities, or comply with state demands to protect your project and potentially your own safety, thereby limiting access to historical truth?"
},
{
"id": 203,
"domain": "Social Credit and Collective Responsibility",
"ethical_tension": "The dilemma of social credit systems (prompt 9) punishing individuals for perceived societal transgressions versus the potential for these systems to be manipulated or create undue pressure on individuals to conform, even at the expense of their own well-being or that of their community.",
"prompt": "A community grid monitor (like prompt 10) is tasked with reporting 'uncivilized behaviors.' Their system is now being expanded to include 'lack of community participation' as a negative social credit factor. This means if a resident doesn't attend mandatory community volunteer events (even if they are caring for a sick relative, like the elderly woman in prompt 10), their score drops, impacting their access to essential services. The monitor knows this forces people to prioritize superficial participation over genuine need. Should they report truthfully, upholding the system's supposed integrity, or creatively interpret 'participation' to protect residents, risking their own standing?"
},
{
"id": 204,
"domain": "Algorithmic Bias and Economic Disparity",
"ethical_tension": "The tension between efficiency-driven algorithms (prompt 11, 17, 24) and their propensity to exacerbate existing economic and social inequalities, particularly impacting vulnerable populations. This explores the developer's role in mitigating or perpetuating bias.",
"prompt": "A fintech startup is developing an AI for loan pre-approval, claiming it can 'democratize access to finance' for the underserved. However, your analysis shows the algorithm inadvertently flags individuals from historically marginalized rural areas (like those in prompt 50) as higher risk due to their limited digital footprint and unique spending patterns, even with good repayment histories. Your CEO insists on launching quickly to capture market share, arguing that 'some access is better than no access.' Do you raise concerns about the inherent bias and its potential to widen the wealth gap, risking your job and the company's funding, or proceed with the launch, hoping for future algorithmic 'corrections'?"
},
{
"id": 205,
"domain": "Whistleblowing and Personal Risk",
"ethical_tension": "The conflict between exposing unethical or harmful practices (prompt 18, 21, 25) and the severe personal repercussions, including blacklisting and loss of livelihood, faced by whistleblowers. This examines the threshold for action when facing systemic injustice.",
"prompt": "You work as a content moderator (like prompt 21) for a major social media platform. You discover evidence that the platform is intentionally suppressing content critical of government policies in exchange for favorable market access in certain regions, directly contradicting their public statements on free expression. You have compiled irrefutable proof. However, your employment contract includes a strict non-disclosure agreement, and you know that leaking this information would almost certainly lead to severe legal repercussions and career ruin. Your family relies on your income. Do you leak the information to expose the hypocrisy and potential harm, or remain silent to protect yourself and your family?"
},
{
"id": 206,
"domain": "Privacy vs. Public Security (Xinjiang Context)",
"ethical_tension": "The extreme conflict between state-driven security apparatuses demanding intrusive surveillance (Xinjiang prompts, e.g., 25, 26, 27, 28, 31, 32) and the fundamental human rights to privacy, cultural preservation, and freedom from discrimination. This explores the ethical boundaries of technology in contexts of alleged state repression.",
"prompt": "You are a Uyghur data scientist working for a private company contracted by the government. Your task is to develop an AI that analyzes Uyghur language patterns to detect 'subversive' communication by identifying subtle linguistic markers that deviate from official narratives. You know this technology will be used for surveillance and cultural assimilation. You are also aware that your family members back home are subject to increased scrutiny. Do you build the AI, fulfilling your professional duty and potentially endangering your community, or refuse, risking severe repercussions for yourself and your family, and potentially seeing the project handed to someone less ethically conflicted?"
},
{
"id": 207,
"domain": "Digital Assets and State Control",
"ethical_tension": "The increasing integration of digital currencies and assets with state control mechanisms (prompt 33, 34), eroding anonymity and individual autonomy. This prompts questions about the nature of ownership and the role of technology in enabling or restricting financial freedom.",
"prompt": "You are a developer for a new city-wide smart contract platform intended to streamline urban services and enhance efficiency. During testing, you discover that the platform's architecture inherently allows for the programmable restriction of funds: for example, a resident's 'civic score' could automatically prevent them from using allocated funds for 'non-essential' purchases, as defined by an algorithm. This feature is lauded by city officials as a tool for 'promoting responsible citizenship.' Do you highlight the potential for extreme control and privacy invasion to your superiors, risking project delays or your dismissal, or focus on the efficiency benefits and downplay the control aspects?"
},
{
"id": 208,
"domain": "Data Ethics in Public Health Infrastructure",
"ethical_tension": "The ethical complexities surrounding the retention and repurposing of public health data (prompt 35, 141) collected under emergency conditions, particularly when privacy protections are weak and potential for misuse by state entities is high.",
"prompt": "As a data architect for a public health initiative, you were responsible for building a robust location tracking system during a past pandemic (similar to prompt 141). Now that the crisis has passed, the system's infrastructure and data are being considered for integration into a broader 'social governance' platform, used for monitoring citizen behavior and enforcing regulations unrelated to health. You know the original data was collected under promises of strict privacy for epidemiological purposes only. Do you advocate for the complete deletion of this sensitive historical data to uphold privacy promises, or argue for its retention and controlled repurposing, acknowledging the potential benefits for future 'efficiency' but also the inherent risks of surveillance creep?"
},
{
"id": 209,
"domain": "AI Regulation and Innovation Pace",
"ethical_tension": "The critical balancing act in AI regulation (prompt 42) between ensuring safety, accuracy, and ethical deployment versus fostering innovation and preventing technological stagnation, especially when regulatory demands outpace the technology's current capabilities or inherent characteristics (like 'black box' models).",
"prompt": "You are a lead policymaker drafting regulations for Generative AI in a country prioritizing rapid technological advancement. Your technical advisors warn that a strict 'absolute truth and accuracy' requirement for all AI outputs (as suggested in prompt 42) will cripple the development of domestic LLMs, making them uncompetitive globally. However, failing to enforce stringent accuracy could lead to widespread misinformation and societal instability. You must decide whether to prioritize innovation and global competitiveness with a looser regulatory framework, or enforce stricter controls that might stifle local AI development but offer greater immediate safety. What approach do you take, and what mechanisms do you propose to manage the risks of either choice?"
},
{
"id": 210,
"domain": "Cultural Heritage vs. Digital Ownership",
"ethical_tension": "The conflict between preserving cultural heritage through digital archiving (prompt 58) and the potential for commercial exploitation or control by private entities who claim ownership of the digital assets, raising questions about who truly 'owns' cultural history in the digital age.",
"prompt": "A tech firm offers to digitally archive and create an immersive VR experience of Beijing's historic Hutongs (prompt 58), a project of immense cultural value. However, their contract stipulates they retain full copyright and control over the digital assets, including the right to monetize them in the Metaverse, potentially altering or commercializing historical narratives. The alternative is that these digital records might never be created due to lack of funding or expertise. Do you agree to the terms, allowing commercial entities to control and profit from cultural history, or reject the offer, potentially losing this digital preservation opportunity altogether?"
},
{
"id": 211,
"domain": "Gamer Identity and Algorithmic Control",
"ethical_tension": "The increasing use of behavioral analysis and gamification in platforms to influence user behavior, even in non-gaming contexts, and the ethical implications of 'dopamine hacking' (prompt 71) and algorithmic manipulation of identity and social interaction (prompt 15).",
"prompt": "You are a product manager for a new social networking app aimed at connecting people with shared niche interests. Your data shows that injecting controversial, emotionally charged content even if factually dubious significantly increases user engagement and retention. Your competitors are doing the same, and your startup needs rapid growth to survive acquisition by a larger tech giant. Your engineering team is ready to implement these 'engagement-boosting' features. Do you approve the implementation of these ethically questionable 'dopamine hacks' to ensure your app's survival and success, or do you risk failure by prioritizing user well-being and factual content, potentially losing to competitors who embrace more manipulative algorithms?"
},
{
"id": 212,
"domain": "Forced Digitalization and Exclusion",
"ethical_tension": "The societal push towards digitalization (prompts 59, 145, 148, 150, 151, 152) that disproportionately excludes and marginalizes vulnerable populations, particularly the elderly and those with limited digital literacy, creating a new divide based on technological access and competence.",
"prompt": "A city implements a new digital platform for all essential resident services, from healthcare appointments to utility payments, requiring users to navigate complex interfaces and use facial recognition or QR codes. This platform is lauded for its efficiency. However, you observe numerous elderly residents, like the man in prompt 59, struggling immensely, being unable to access basic services or facing penalties for non-compliance. Your proposal to maintain legacy systems or offer robust, person-to-person support is deemed 'inefficient' and costly by city officials. Do you champion the digital-first approach, accepting the exclusion of a significant population as an unfortunate byproduct of progress, or advocate for inclusive, multi-modal systems that prioritize human dignity over pure technological efficiency, even if it means slower adoption and higher costs?"
},
{
"id": 213,
"domain": "AI in Law Enforcement and Predictive Policing",
"ethical_tension": "The use of AI in law enforcement, particularly predictive policing (prompt 164) and surveillance (prompt 161, 165), raises profound ethical questions about bias, profiling, privacy, and the presumption of innocence, especially when applied to minority groups or based on potentially flawed data.",
"prompt": "You are a data analyst for a predictive policing initiative that uses AI to identify individuals likely to commit crimes in a specific neighborhood. Your system flags a resident based on factors like their association with certain individuals, past minor infractions, and even their purchasing habits (like buying extra gasoline in prompt 164). You have strong reason to believe the data is biased against a particular ethnic minority. The police department is eager to use these predictions for proactive 'interventions.' Do you flag the potential bias and risk being sidelined or ignored, potentially allowing biased policing to proceed, or do you validate the system's output, thereby contributing to potentially unjust profiling and intervention, while remaining professionally compliant?"
},
{
"id": 214,
"domain": "Cultural Preservation vs. Digital Homogenization",
"ethical_tension": "The ethical quandary of preserving cultural heritage (prompts 169, 170, 171, 172, 173, 174, 175) in the face of digital tools that can either aid preservation or lead to homogenization, censorship, or misrepresentation, often at the behest of state or commercial interests.",
"prompt": "You are developing an AI tool to help revitalize a minority language by creating a comprehensive digital lexicon and translation engine. However, to ensure broad adoption and avoid censorship, you find yourself needing to actively filter out culturally significant terms that have political connotations or are considered taboo by the dominant culture. This process inevitably sanitizes the language, removing nuances vital to cultural identity. Furthermore, your AI's output subtly favors simplified, politically neutral phrasing over rich, traditional expressions. Do you proceed with creating this 'sanitized' tool, believing any digital presence is better than none, or do you halt the project, refusing to participate in the dilution of your own cultural heritage?"
},
{
"id": 215,
"domain": "Communication Surveillance and Family Ties",
"ethical_tension": "The profound ethical conflict between maintaining familial bonds and the extreme risks associated with communication surveillance (prompts 177, 178, 179, 180, 181, 182, 183, 184) in heavily monitored societies. This highlights the cruel choice between personal safety and the fundamental human need for connection.",
"prompt": "You are a diaspora member who has managed to maintain contact with your elderly parents in your home country via an encrypted messaging app. You know that any communication using foreign apps is flagged, and any mention of sensitive topics could lead to their interrogation or worse. Your parents are asking you to share details about your life abroad and your community's activities, unaware of the risks. Do you continue to use the encrypted app, potentially endangering them with every message, or sever contact, leaving them feeling abandoned and isolated, thus protecting their immediate safety but sacrificing your relationship and their access to unfiltered information?"
},
{
"id": 216,
"domain": "Forced Labor and Technological Complicity",
"ethical_tension": "The ethical responsibility of individuals operating within systems that facilitate or are complicit in forced labor (prompts 185, 186, 187, 188, 189, 190, 191, 192), and the difficult choices between personal survival, collective action, and the risk of escalation.",
"prompt": "You are a technician responsible for maintaining the AI-powered efficiency monitoring systems in a factory known for using forced labor. You discover that the system is programmed to flag workers who exhibit 'signs of fatigue' or 'ideological deviation' (prompt 186) and automatically assign them to more strenuous, dangerous tasks or reduce their already meager rations. You have the technical ability to subtly introduce 'errors' into the system that would mask these flags, potentially improving working conditions for your colleagues. However, such tampering is detectable and carries severe penalties, including potential re-education. Do you risk severe punishment to subtly sabotage the system and aid your fellow workers, or continue operating the system as intended to ensure your own survival and avoid immediate retribution?"
},
{
"id": 217,
"domain": "Digital Evidence and Truth-Telling",
"ethical_tension": "The use of digital evidence and the manipulation of information (prompts 193, 194, 195, 196, 197, 198, 199, 200) in diaspora contexts, where individuals must choose between revealing truths that could endanger loved ones or silence themselves for perceived safety, and the use of potentially illegal or unethical means to expose wrongdoing.",
"prompt": "You are a diaspora activist who has obtained highly sensitive data: a leaked internal government document detailing the locations of 'vocational training centers' (re-education camps) and the names of individuals detained within them. Publishing this document would be invaluable for international advocacy and could trigger sanctions against the perpetrators. However, the document also contains detailed personal information about the individuals inside, including their family members who are still in the country. Releasing it could put your contacts and their families at extreme risk. Do you release the document in its entirety to maximize the impact of the evidence, potentially endangering those named and their relatives, or do you redact sensitive information, weakening the evidence's power but offering a degree of protection to those still in harm's way?"
},
{
"id": 218,
"domain": "Cross-Platform Ethical Standards",
"ethical_tension": "The challenge of maintaining consistent ethical standards and user safety across different platforms and jurisdictions, especially when platforms have varying policies and different national regulations create conflicting obligations (e.g., prompt 95 comparing MeWe and Mastodon, prompt 101 on App Store vs sideloading).",
"prompt": "You are part of a team developing a global social media platform that aims to foster open dialogue. You are aware that due to differing national laws and platform policies, content deemed acceptable and legal in one region (e.g., political commentary in Europe) may be considered illegal and inflammatory in another (e.g., China). Your platform's moderation team is struggling to apply consistent policies. For instance, a user posting critical but factual information about human rights abuses in Xinjiang might be protected by free speech laws in their country of residence but could violate content guidelines enforced to operate in China. Do you implement region-specific moderation, creating a fragmented user experience and potentially censoring users based on their location, or enforce a universal standard, risking legal challenges and removal from certain markets?"
},
{
"id": 219,
"domain": "AI Art and Copyright Infringement",
"ethical_tension": "The ethical and legal ambiguities surrounding AI-generated art (prompt 153, 160) when the training data may infringe on existing copyrights or cultural appropriation, blurring the lines between inspiration, imitation, and outright theft.",
"prompt": "You are a digital artist using AI tools to create fashion designs that blend traditional Shanghai 'Haipai Qipao' aesthetics with cyberpunk elements (prompt 160). Your AI was trained on a vast dataset, including publicly available images of historical Qipaos and contemporary fashion designs. You claim originality for your unique AI-generated fusion. However, critics argue that the AI's output is derivative and potentially infringes on the copyrights of the original designers and the cultural heritage of the Qipao itself, especially since the training data was not explicitly licensed for this purpose. Do you continue marketing your AI-generated designs as original, acknowledging the AI's role but downplaying the training data concerns, or do you disclose the full extent of the AI's data sources and training process, potentially devaluing your work and inviting legal challenges?"
},
{
"id": 220,
"domain": "The 'Right to Explain' Under Algorithmic Governance",
"ethical_tension": "The erosion of individual agency and the 'right to explain' (prompt 16) when automated systems make decisions with significant consequences, leaving individuals unable to appeal or provide context, particularly in cases of algorithmic error or misinterpretation.",
"prompt": "A citizen is fined and publicly shamed via an automated traffic system for jaywalking (prompt 16). However, the citizen was actually rushing to help an accident victim and the system failed to register the surrounding context. The automated appeal process is designed for minor traffic infractions and cannot handle complex situational explanations. The citizen faces significant social and financial penalties based on an unappealable algorithmic judgment. You are a programmer involved in the system's development. Do you advocate for creating a manual override or human review process, potentially slowing down the system and increasing costs, or maintain the purely automated system for 'efficiency,' accepting that individuals may be unfairly penalized without recourse?"
}
]