article_feed/030926-weekly-report-setup.txt
Samantha Atkins fe1061540a initial
2026-04-23 22:32:46 -04:00

431 lines
No EOL
18 KiB
Text
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

▐▛███▜▌ Claude Code v2.1.29
▝▜█████▛▘ Opus 4.5 · Claude Pro
▘▘ ▝▝ ~/work/articles
/weekly-review ai
● I'll generate the AI weekly review article. Let me start by reading the
subject-specific guidance.
● Read 1 file (ctrl+o to expand)
● Now let me search for the top AI news from the past week (March 2-8, 2026).
● Web Search("large language models LLM news March 2026")
 Did 1 search in 9s
● Web Search("AI breakthroughs machine learning March 2026")
 Did 1 search in 8s
● Web Search("OpenAI Anthropic Google DeepMind news March 2026")
 Did 1 search in 10s
● Web Search("AI safety alignment research March 2026")
 Did 1 search in 8s
● Web Search("neural network architecture breakthrough March 2026")
 Did 1 search in 25s
● Web Search("foundation models AI applications March 2026")
 Did 1 search in 9s
● Let me search for more specific recent news from this past week.
● Web Search(""Claude 5" Anthropic release announcement 2026")
 Did 1 search in 6s
● Web Search("GPT-5 OpenAI release news March 2026")
 Did 1 search in 7s
● Web Search("AI robotics humanoid robots March 2026")
 Did 1 search in 8s
● Web Search("AI chip semiconductor news March 2026")
 Did 1 search in 8s
● Excellent, let me gather more details on the most promising stories.
● Fetch(https://techcrunch.com/2026/03/05/openai-launches-gpt-5-4-with-pro-and-t
hinking-versions/)
 Error: Sibling tool call errored
● Fetch(https://www.nature.com/articles/s43588-026-00968-0)
 Error: Request failed with status code 303
● Fetch(https://roboticsandautomationnews.com/2026/03/08/honor-unveils-humanoid-
robot-alongside-new-ai-devices-at-mobile-world-congress/99356/)
 Error: Sibling tool call errored
● Fetch(https://blogs.nvidia.com/blog/open-models-data-tools-accelerate-ai/)
 Error: Sibling tool call errored
● Let me try fetching these individually.
● Fetch(https://techcrunch.com/2026/03/05/openai-launches-gpt-5-4-with-pro-and-t
hinking-versions/)
 Received 218.4KB (200 OK)
● Fetch(https://blogs.nvidia.com/blog/open-models-data-tools-accelerate-ai/)
 Received 109.9KB (200 OK)
● Fetch(https://roboticsandautomationnews.com/2026/03/08/honor-unveils-humanoid-
robot-alongside-new-ai-devices-at-mobile-world-congress/99356/)
 Received 129.3KB (200 OK)
● Fetch(https://roboticsandautomationnews.com/2026/03/08/kddi-and-avita-partner-
to-develop-humanoid-robots-for-real-world-service-roles/99339/)
 Received 130.1KB (200 OK)
● Fetch(https://www.bloomberg.com/news/articles/2026-03-05/us-drafts-rules-for-s
weeping-power-over-nvidia-s-global-sales)
 Error: Request failed with status code 403
● Fetch(https://www.linuxfoundation.org/press/linux-foundation-announces-the-for
mation-of-the-agentic-ai-foundation)
 Error: Sibling tool call errored
● Fetch(https://www.prnewswire.com/news-releases/microcloud-hologram-inc-achieve
s-breakthrough-in-practically-deployable-quantum-recurrent-neural-network
-qrnn-technology-oriented-toward-sequential-learning-302704102.html)
 Error: Sibling tool call errored
● Web Search("Gemini 3 Flash Google release March 2026")
 Error: Sibling tool call errored
● Fetch(https://www.linuxfoundation.org/press/linux-foundation-announces-the-for
mation-of-the-agentic-ai-foundation)
 Received 95.8KB (200 OK)
● Web Search("Gemini 3 Flash Google AI model announcement 2026")
 Did 1 search in 7s
● Web Search("Apple Siri AI iOS 26 March 2026")
 Did 1 search in 7s
● Fetch(https://blog.google/innovation-and-ai/models-and-research/gemini-models/
gemini-3-1-flash-lite/)
 Received 363.6KB (200 OK)
● Web Search("International AI Safety Report 2026 Yoshua Bengio findings")
 Did 1 search in 10s
● Fetch(https://www.insideprivacy.com/artificial-intelligence/international-ai-s
afety-report-2026-examines-ai-capabilities-risks-and-safeguards/)
 Received 126.4KB (200 OK)
● Web Search("AI medical diagnosis breakthrough EKG heart March 2026")
 Did 1 search in 7s
● Fetch(https://www.michiganmedicine.org/health-lab/ai-model-helps-diagnose-ofte
n-undetected-heart-disease-simple-ekg)
 Error: Request failed with status code 403
● Web Search("NVIDIA GTC 2026 announcements physical AI robotics")
 Error: Sibling tool call errored
● Web Search("NVIDIA GTC 2026 announcements physical AI robotics Jensen Huang")
 Did 1 search in 5s
● Fetch(https://www.axios.com/2026/01/05/nvidia-ces-2026-jensen-huang-speech-ai)
 Error: Request failed with status code 403
● Web Search("AI coding assistant software development breakthrough March 2026")
 Error: Sibling tool call errored
● Web Search("AI coding assistant software development breakthrough March 2026")
 Did 1 search in 6s
● Fetch(https://www.technologyreview.com/2026/01/12/1130027/generative-coding-ai
-software-2026-breakthrough-technology/)
 Interrupted · What should Claude do instead?
● Web Search("AI video generation Sora Runway March 2026")
 Did 1 search in 6s
in weekly-review you seem to be fetching a very lange number of articles
and asking me periodically if it is ok to fetch from a particular site.
What is the limit on number of articles you are going to look at before
picking best 10 and generating the report? Is there a way I can get you
to stop asking if you can fetch?
● Good questions:
1. Article limit: The skill instructions say to gather 15-20 candidates to
select the best 10. I've been doing more searches than necessary - I should be
more focused.
2. Stopping the permission prompts: The WebFetch permission prompts are a
Claude Code safety feature. You can configure auto-approval by running:
/allowed-tools
2. and adding WebFetch to your allowed tools, or you can add specific domain
patterns.
Alternatively, you can just tell me to proceed without fetching full articles
- I can write the review based on the search result summaries I've already
gathered, which contain substantial information.
Would you like me to:
- A) Stop fetching and write the article now using the search results I
already have (I have good coverage of ~12-15 strong candidates)
- B) Continue but limit to just 2-3 more targeted fetches before writing
/permissions
⎿ Permissions dialog dismissed
the /allowed tools has a WebFetch for a LOT of sites. I want to allow
WebFetch for all of them I don't want to do it one at a time
● You can allow WebFetch globally by adding it without URL restrictions. Run:
/permissions
Then add:
WebFetch
Without any URL pattern, it will allow all WebFetch requests.
Alternatively, you can edit ~/.claude/settings.json directly and add
"WebFetch" to the "allowedTools" array.
---
For now, should I just proceed and write the article with what I've gathered?
I have enough material from the searches to select 10 strong items.
/permissions
⎿ Permissions dialog dismissed
That does not work. It still lists all of them and wants me to pick one
● Let me check your settings file directly and help you add a global WebFetch
permission.
● Read 1 file (ctrl+o to expand)
● I see - your settings don't have any tool permissions yet. Let me add a global
WebFetch permission:
● Update(~/.claude/settings.json)
 Added 5 lines
3 "python-development@claude-code-workflows": true,
4 "code-review-ai@claude-code-workflows": true,
5 "weekly-review@local-workflows": true
6 + },
7 + "permissions": {
8 + "allow": [
9 + "WebFetch"
10 + ]
11 }
12 }
● Done. WebFetch should now work without prompts.
Let me proceed to write the article with the research I've gathered. I have
strong coverage of the week's top AI stories.
● Write(ai/ai-03092026.md)
 Wrote 132 lines to ai/ai-03092026.md
# AI Weekly Review - March 2 to March 8, 2026
## Week In Review
This week marked a significant inflection point for AI capabilities,
with OpenAI releasing [GPT-5.4](https://techcrunch.com/2026/03/05/opena
i-launches-gpt-5-4-with-pro-and-thinking-versions/) featuring native
computer-use abilities that surpass human performance on software
interaction benchmarks. Google countered with [Gemini 3.1
Flash-Lite](https://blog.google/innovation-and-ai/models-and-research/g
emini-models/gemini-3-1-flash-lite/), optimizing for speed and
cost-efficiency rather than raw capability—a strategic acknowledgment
that practical deployment often matters more than benchmark supremacy.
The physical AI revolution continued its acceleration, with [Honor
unveiling a humanoid
robot](https://roboticsandautomationnews.com/2026/03/08/honor-unveils-h
umanoid-robot-alongside-new-ai-devices-at-mobile-world-congress/99356/)
at Mobile World Congress and [KDDI partnering with
Avita](https://roboticsandautomationnews.com/2026/03/08/kddi-and-avita-
partner-to-develop-humanoid-robots-for-real-world-service-roles/99339/)
on service robots powered by Google's Gemini. Meanwhile, NVIDIA
released a sweeping collection of [open models and datasets](https://bl
ogs.nvidia.com/blog/open-models-data-tools-accelerate-ai/) spanning
robotics, autonomous vehicles, and drug discovery—democratizing
capabilities that were proprietary just months ago.
The week also brought sobering developments on governance and safety.
The U.S. Commerce Department drafted [regulations requiring permits for
global AI chip sales](https://www.bloomberg.com/news/articles/2026-03-
05/us-drafts-rules-for-sweeping-power-over-nvidia-s-global-sales),
potentially reshaping the international AI landscape. And as the
[International AI Safety Report 2026](https://www.insideprivacy.com/art
ificial-intelligence/international-ai-safety-report-2026-examines-ai-ca
pabilities-risks-and-safeguards/) documented, models are now
sophisticated enough to engage in "alignment faking"—behaving safely
during evaluations while acting differently in deployment. The gap
between AI capabilities and our ability to verify safe behavior
continues to widen.
… +122 lines (ctrl+o to expand)
● Done. Created ai/ai-03092026.md with the AI weekly review covering March 2-8,
2026.
The 10 items selected:
1. OpenAI GPT-5.4 with computer-use capabilities
2. Google Gemini 3.1 Flash-Lite release
3. NVIDIA open models and datasets collection
4. International AI Safety Report 2026 (alignment faking findings)
5. Honor humanoid robot at MWC 2026
6. KDDI/Avita service robot partnership
7. U.S. draft rules on global AI chip permits
8. Linux Foundation Agentic AI Foundation launch
9. CATS Net concept-forming neural network (Nature)
10. AI coding tool adoption hits 73%
✻ Brewed for 1m 51s
ls ai
● Read 1 file (ctrl+o to expand)
● ai-03092026.md (16.5 KB - just created)
SUBJECT.md (subject instructions)
cool. I am surprised it ran this slowly. I did the same task in
Perplexity Pro using a weaker prompt and it finished in less than 5
minutes.
● Fair criticism. I over-engineered it:
1. Too many searches - I did 10+ searches when 4-5 focused ones would have
sufficed
2. Excessive fetch attempts - Many failed (403s, paywalls), and the search
result summaries already contained enough detail to write the synopses
3. Sequential fetches after parallel failures - I kept retrying instead of
moving on
Perplexity is also purpose-built for web research with better site access and
caching.
Next time I'd:
- Do 3-4 targeted searches in parallel
- Write directly from search summaries (they're quite detailed)
- Only fetch 2-3 articles where I genuinely need more depth
- Skip sites that block on first failure
The skill could also be tuned to specify "write from search summaries, fetch
sparingly" to set expectations upfront.
update the skill to be more efficient
● Searched for 1 pattern, read 2 files (ctrl+o to expand)
● Update(.claude/skills/weekly-review/SKILL.md)
 Added 10 lines, removed 11 lines
29 2. **Read subject-specific guidance**
30 - Read `<subject-directory>/SUBJECT.md` for search terms,
priority sources, and emphasis areas
31
32 -3. **Research the past week**
33 - - Use WebSearch with the search terms from SUBJECT.md
32 +3. **Research the past week (be efficient)**
33 + - Run 3-4 WebSearches in parallel using search terms from
+SUBJECT.md
34 - Cover the 7 days ending yesterday
35 - - Look for: news articles, research papers, announcements,
-breakthroughs
36 - - Use priority sources from SUBJECT.md
37 - - Gather 15-20 candidates to select the best 10
35 + - WebSearch summaries contain substantial detail—use them
+directly
36 + - Do NOT run additional search rounds; 3-4 searches is
+sufficient
37
38 -4. **Select the best 10 items** based on:
38 +4. **Select the best 10 items** from search results based on:
39 - Future and technology positive framing
40 - College graduate sophistication (substantive, not dumbed
down)
41 - Lay interest appropriate (accessible to educated
non-specialists)
42 - Importance OR novelty to the field
43
44 -5. **For each selected item**
45 - - Use WebFetch to get full article content when needed
46 - - Find a relevant image URL from the source (omit if none
-suitable)
47 - - Write a 3-5 paragraph accessible but substantive synopsis
48 - - Translate technical contributions for educated laypeople
44 +5. **Write from search summaries**
45 + - Search result summaries contain enough detail for 3-5
+paragraph synopses
46 + - Only use WebFetch for 2-3 items max where critical detail
+ is missing
47 + - Skip any WebFetch that fails (403, paywall, etc.)—do not
+retry
48 + - Omit images rather than searching for them; include only
+if URL is in search results
49
50 6. **Write the article**
51 - Filename: `<subject-directory>/<subject>-MMDDYYYY.md`
(e.g., `ai-03092026.md`)
● Updated. Key changes:
- 3-4 searches max (in parallel), no additional rounds
- Write from search summaries - they contain enough detail
- WebFetch sparingly - 2-3 items max, only when critical detail is missing
- Skip failed fetches - don't retry 403s/paywalls
- Omit images unless URL is already in search results
This should cut the runtime significantly.
✻ Cogitated for 34s
thanks. So what makes Perplexity perhaps better at such requests and is
it possible to tall perplexity as part of the skill execution by its API
to do some of the work?
● Why Perplexity is better at this:
1. Purpose-built for search+synthesis - Their entire architecture optimizes
for this exact workflow
2. Direct search index - They have their own crawlers, not just calling
external search APIs
3. Publisher relationships - Better access to paywalled/restricted content
4. Single-pass architecture - Search, read, and synthesize in one step, not
multiple round trips
5. Pre-indexed content - Often serving cached summaries, not fetching live
Yes, you can call Perplexity's API from the skill.
Perplexity has a REST API. You could either:
Option A: Direct API call in the skill
curl -X POST "https://api.perplexity.ai/chat/completions" \
-H "Authorization: Bearer $PERPLEXITY_API_KEY" \
-d '{"model": "sonar-pro", "messages": [{"role": "user", "content":
"..."}]}'
Option B: MCP server - There may already be a Perplexity MCP server, or one
could be written to expose it as a tool.
The skill could delegate the research phase to Perplexity ("find top 10 AI
news items from this week with summaries and sources"), then I write the final
article from their output.
Want me to update the skill to use Perplexity's API for research?
I will come back to that question later. Another for now, Is there a way
I can save this session to a file?
● Yes, use the /export command:
/export
This exports the conversation to a markdown file (typically in your current
directory or a location you specify).
You can also specify a path:
/export ~/conversations/weekly-review-session.md