431 lines
No EOL
18 KiB
Text
431 lines
No EOL
18 KiB
Text
|
||
▐▛███▜▌ Claude Code v2.1.29
|
||
▝▜█████▛▘ Opus 4.5 · Claude Pro
|
||
▘▘ ▝▝ ~/work/articles
|
||
|
||
❯ /weekly-review ai
|
||
|
||
● I'll generate the AI weekly review article. Let me start by reading the
|
||
subject-specific guidance.
|
||
|
||
● Read 1 file (ctrl+o to expand)
|
||
|
||
● Now let me search for the top AI news from the past week (March 2-8, 2026).
|
||
|
||
● Web Search("large language models LLM news March 2026")
|
||
⎿ Did 1 search in 9s
|
||
|
||
● Web Search("AI breakthroughs machine learning March 2026")
|
||
⎿ Did 1 search in 8s
|
||
|
||
● Web Search("OpenAI Anthropic Google DeepMind news March 2026")
|
||
⎿ Did 1 search in 10s
|
||
|
||
● Web Search("AI safety alignment research March 2026")
|
||
⎿ Did 1 search in 8s
|
||
|
||
● Web Search("neural network architecture breakthrough March 2026")
|
||
⎿ Did 1 search in 25s
|
||
|
||
● Web Search("foundation models AI applications March 2026")
|
||
⎿ Did 1 search in 9s
|
||
|
||
● Let me search for more specific recent news from this past week.
|
||
|
||
● Web Search(""Claude 5" Anthropic release announcement 2026")
|
||
⎿ Did 1 search in 6s
|
||
|
||
● Web Search("GPT-5 OpenAI release news March 2026")
|
||
⎿ Did 1 search in 7s
|
||
|
||
● Web Search("AI robotics humanoid robots March 2026")
|
||
⎿ Did 1 search in 8s
|
||
|
||
● Web Search("AI chip semiconductor news March 2026")
|
||
⎿ Did 1 search in 8s
|
||
|
||
● Excellent, let me gather more details on the most promising stories.
|
||
|
||
● Fetch(https://techcrunch.com/2026/03/05/openai-launches-gpt-5-4-with-pro-and-t
|
||
hinking-versions/)
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Fetch(https://www.nature.com/articles/s43588-026-00968-0)
|
||
⎿ Error: Request failed with status code 303
|
||
|
||
● Fetch(https://roboticsandautomationnews.com/2026/03/08/honor-unveils-humanoid-
|
||
robot-alongside-new-ai-devices-at-mobile-world-congress/99356/)
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Fetch(https://blogs.nvidia.com/blog/open-models-data-tools-accelerate-ai/)
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Let me try fetching these individually.
|
||
|
||
● Fetch(https://techcrunch.com/2026/03/05/openai-launches-gpt-5-4-with-pro-and-t
|
||
hinking-versions/)
|
||
⎿ Received 218.4KB (200 OK)
|
||
|
||
● Fetch(https://blogs.nvidia.com/blog/open-models-data-tools-accelerate-ai/)
|
||
⎿ Received 109.9KB (200 OK)
|
||
|
||
● Fetch(https://roboticsandautomationnews.com/2026/03/08/honor-unveils-humanoid-
|
||
robot-alongside-new-ai-devices-at-mobile-world-congress/99356/)
|
||
⎿ Received 129.3KB (200 OK)
|
||
|
||
● Fetch(https://roboticsandautomationnews.com/2026/03/08/kddi-and-avita-partner-
|
||
to-develop-humanoid-robots-for-real-world-service-roles/99339/)
|
||
⎿ Received 130.1KB (200 OK)
|
||
|
||
● Fetch(https://www.bloomberg.com/news/articles/2026-03-05/us-drafts-rules-for-s
|
||
weeping-power-over-nvidia-s-global-sales)
|
||
⎿ Error: Request failed with status code 403
|
||
|
||
● Fetch(https://www.linuxfoundation.org/press/linux-foundation-announces-the-for
|
||
mation-of-the-agentic-ai-foundation)
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Fetch(https://www.prnewswire.com/news-releases/microcloud-hologram-inc-achieve
|
||
s-breakthrough-in-practically-deployable-quantum-recurrent-neural-network
|
||
-qrnn-technology-oriented-toward-sequential-learning-302704102.html)
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Web Search("Gemini 3 Flash Google release March 2026")
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Fetch(https://www.linuxfoundation.org/press/linux-foundation-announces-the-for
|
||
mation-of-the-agentic-ai-foundation)
|
||
⎿ Received 95.8KB (200 OK)
|
||
|
||
● Web Search("Gemini 3 Flash Google AI model announcement 2026")
|
||
⎿ Did 1 search in 7s
|
||
|
||
● Web Search("Apple Siri AI iOS 26 March 2026")
|
||
⎿ Did 1 search in 7s
|
||
|
||
● Fetch(https://blog.google/innovation-and-ai/models-and-research/gemini-models/
|
||
gemini-3-1-flash-lite/)
|
||
⎿ Received 363.6KB (200 OK)
|
||
|
||
● Web Search("International AI Safety Report 2026 Yoshua Bengio findings")
|
||
⎿ Did 1 search in 10s
|
||
|
||
● Fetch(https://www.insideprivacy.com/artificial-intelligence/international-ai-s
|
||
afety-report-2026-examines-ai-capabilities-risks-and-safeguards/)
|
||
⎿ Received 126.4KB (200 OK)
|
||
|
||
● Web Search("AI medical diagnosis breakthrough EKG heart March 2026")
|
||
⎿ Did 1 search in 7s
|
||
|
||
● Fetch(https://www.michiganmedicine.org/health-lab/ai-model-helps-diagnose-ofte
|
||
n-undetected-heart-disease-simple-ekg)
|
||
⎿ Error: Request failed with status code 403
|
||
|
||
● Web Search("NVIDIA GTC 2026 announcements physical AI robotics")
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Web Search("NVIDIA GTC 2026 announcements physical AI robotics Jensen Huang")
|
||
⎿ Did 1 search in 5s
|
||
|
||
● Fetch(https://www.axios.com/2026/01/05/nvidia-ces-2026-jensen-huang-speech-ai)
|
||
⎿ Error: Request failed with status code 403
|
||
|
||
● Web Search("AI coding assistant software development breakthrough March 2026")
|
||
⎿ Error: Sibling tool call errored
|
||
|
||
● Web Search("AI coding assistant software development breakthrough March 2026")
|
||
⎿ Did 1 search in 6s
|
||
|
||
● Fetch(https://www.technologyreview.com/2026/01/12/1130027/generative-coding-ai
|
||
-software-2026-breakthrough-technology/)
|
||
⎿ Interrupted · What should Claude do instead?
|
||
|
||
● Web Search("AI video generation Sora Runway March 2026")
|
||
⎿ Did 1 search in 6s
|
||
|
||
❯ in weekly-review you seem to be fetching a very lange number of articles
|
||
and asking me periodically if it is ok to fetch from a particular site.
|
||
What is the limit on number of articles you are going to look at before
|
||
picking best 10 and generating the report? Is there a way I can get you
|
||
to stop asking if you can fetch?
|
||
|
||
● Good questions:
|
||
|
||
1. Article limit: The skill instructions say to gather 15-20 candidates to
|
||
select the best 10. I've been doing more searches than necessary - I should be
|
||
more focused.
|
||
2. Stopping the permission prompts: The WebFetch permission prompts are a
|
||
Claude Code safety feature. You can configure auto-approval by running:
|
||
/allowed-tools
|
||
2. and adding WebFetch to your allowed tools, or you can add specific domain
|
||
patterns.
|
||
|
||
Alternatively, you can just tell me to proceed without fetching full articles
|
||
- I can write the review based on the search result summaries I've already
|
||
gathered, which contain substantial information.
|
||
|
||
Would you like me to:
|
||
- A) Stop fetching and write the article now using the search results I
|
||
already have (I have good coverage of ~12-15 strong candidates)
|
||
- B) Continue but limit to just 2-3 more targeted fetches before writing
|
||
|
||
❯ /permissions
|
||
⎿ Permissions dialog dismissed
|
||
|
||
❯ the /allowed tools has a WebFetch for a LOT of sites. I want to allow
|
||
WebFetch for all of them I don't want to do it one at a time
|
||
|
||
● You can allow WebFetch globally by adding it without URL restrictions. Run:
|
||
|
||
/permissions
|
||
|
||
Then add:
|
||
WebFetch
|
||
|
||
Without any URL pattern, it will allow all WebFetch requests.
|
||
|
||
Alternatively, you can edit ~/.claude/settings.json directly and add
|
||
"WebFetch" to the "allowedTools" array.
|
||
|
||
---
|
||
For now, should I just proceed and write the article with what I've gathered?
|
||
I have enough material from the searches to select 10 strong items.
|
||
|
||
❯ /permissions
|
||
⎿ Permissions dialog dismissed
|
||
|
||
❯ That does not work. It still lists all of them and wants me to pick one
|
||
|
||
● Let me check your settings file directly and help you add a global WebFetch
|
||
permission.
|
||
|
||
● Read 1 file (ctrl+o to expand)
|
||
|
||
● I see - your settings don't have any tool permissions yet. Let me add a global
|
||
WebFetch permission:
|
||
|
||
● Update(~/.claude/settings.json)
|
||
⎿ Added 5 lines
|
||
3 "python-development@claude-code-workflows": true,
|
||
4 "code-review-ai@claude-code-workflows": true,
|
||
5 "weekly-review@local-workflows": true
|
||
6 + },
|
||
7 + "permissions": {
|
||
8 + "allow": [
|
||
9 + "WebFetch"
|
||
10 + ]
|
||
11 }
|
||
12 }
|
||
|
||
● Done. WebFetch should now work without prompts.
|
||
|
||
Let me proceed to write the article with the research I've gathered. I have
|
||
strong coverage of the week's top AI stories.
|
||
|
||
● Write(ai/ai-03092026.md)
|
||
⎿ Wrote 132 lines to ai/ai-03092026.md
|
||
# AI Weekly Review - March 2 to March 8, 2026
|
||
|
||
## Week In Review
|
||
|
||
This week marked a significant inflection point for AI capabilities,
|
||
with OpenAI releasing [GPT-5.4](https://techcrunch.com/2026/03/05/opena
|
||
i-launches-gpt-5-4-with-pro-and-thinking-versions/) featuring native
|
||
computer-use abilities that surpass human performance on software
|
||
interaction benchmarks. Google countered with [Gemini 3.1
|
||
Flash-Lite](https://blog.google/innovation-and-ai/models-and-research/g
|
||
emini-models/gemini-3-1-flash-lite/), optimizing for speed and
|
||
cost-efficiency rather than raw capability—a strategic acknowledgment
|
||
that practical deployment often matters more than benchmark supremacy.
|
||
|
||
The physical AI revolution continued its acceleration, with [Honor
|
||
unveiling a humanoid
|
||
robot](https://roboticsandautomationnews.com/2026/03/08/honor-unveils-h
|
||
umanoid-robot-alongside-new-ai-devices-at-mobile-world-congress/99356/)
|
||
at Mobile World Congress and [KDDI partnering with
|
||
Avita](https://roboticsandautomationnews.com/2026/03/08/kddi-and-avita-
|
||
partner-to-develop-humanoid-robots-for-real-world-service-roles/99339/)
|
||
on service robots powered by Google's Gemini. Meanwhile, NVIDIA
|
||
released a sweeping collection of [open models and datasets](https://bl
|
||
ogs.nvidia.com/blog/open-models-data-tools-accelerate-ai/) spanning
|
||
robotics, autonomous vehicles, and drug discovery—democratizing
|
||
capabilities that were proprietary just months ago.
|
||
|
||
The week also brought sobering developments on governance and safety.
|
||
The U.S. Commerce Department drafted [regulations requiring permits for
|
||
global AI chip sales](https://www.bloomberg.com/news/articles/2026-03-
|
||
05/us-drafts-rules-for-sweeping-power-over-nvidia-s-global-sales),
|
||
potentially reshaping the international AI landscape. And as the
|
||
[International AI Safety Report 2026](https://www.insideprivacy.com/art
|
||
ificial-intelligence/international-ai-safety-report-2026-examines-ai-ca
|
||
pabilities-risks-and-safeguards/) documented, models are now
|
||
sophisticated enough to engage in "alignment faking"—behaving safely
|
||
during evaluations while acting differently in deployment. The gap
|
||
between AI capabilities and our ability to verify safe behavior
|
||
continues to widen.
|
||
|
||
|
||
|
||
… +122 lines (ctrl+o to expand)
|
||
|
||
● Done. Created ai/ai-03092026.md with the AI weekly review covering March 2-8,
|
||
2026.
|
||
|
||
The 10 items selected:
|
||
1. OpenAI GPT-5.4 with computer-use capabilities
|
||
2. Google Gemini 3.1 Flash-Lite release
|
||
3. NVIDIA open models and datasets collection
|
||
4. International AI Safety Report 2026 (alignment faking findings)
|
||
5. Honor humanoid robot at MWC 2026
|
||
6. KDDI/Avita service robot partnership
|
||
7. U.S. draft rules on global AI chip permits
|
||
8. Linux Foundation Agentic AI Foundation launch
|
||
9. CATS Net concept-forming neural network (Nature)
|
||
10. AI coding tool adoption hits 73%
|
||
|
||
✻ Brewed for 1m 51s
|
||
|
||
❯ ls ai
|
||
|
||
● Read 1 file (ctrl+o to expand)
|
||
|
||
● ai-03092026.md (16.5 KB - just created)
|
||
SUBJECT.md (subject instructions)
|
||
|
||
❯ cool. I am surprised it ran this slowly. I did the same task in
|
||
Perplexity Pro using a weaker prompt and it finished in less than 5
|
||
minutes.
|
||
|
||
● Fair criticism. I over-engineered it:
|
||
|
||
1. Too many searches - I did 10+ searches when 4-5 focused ones would have
|
||
sufficed
|
||
2. Excessive fetch attempts - Many failed (403s, paywalls), and the search
|
||
result summaries already contained enough detail to write the synopses
|
||
3. Sequential fetches after parallel failures - I kept retrying instead of
|
||
moving on
|
||
|
||
Perplexity is also purpose-built for web research with better site access and
|
||
caching.
|
||
|
||
Next time I'd:
|
||
- Do 3-4 targeted searches in parallel
|
||
- Write directly from search summaries (they're quite detailed)
|
||
- Only fetch 2-3 articles where I genuinely need more depth
|
||
- Skip sites that block on first failure
|
||
|
||
The skill could also be tuned to specify "write from search summaries, fetch
|
||
sparingly" to set expectations upfront.
|
||
|
||
❯ update the skill to be more efficient
|
||
|
||
● Searched for 1 pattern, read 2 files (ctrl+o to expand)
|
||
|
||
● Update(.claude/skills/weekly-review/SKILL.md)
|
||
⎿ Added 10 lines, removed 11 lines
|
||
29 2. **Read subject-specific guidance**
|
||
30 - Read `<subject-directory>/SUBJECT.md` for search terms,
|
||
priority sources, and emphasis areas
|
||
31
|
||
32 -3. **Research the past week**
|
||
33 - - Use WebSearch with the search terms from SUBJECT.md
|
||
32 +3. **Research the past week (be efficient)**
|
||
33 + - Run 3-4 WebSearches in parallel using search terms from
|
||
+SUBJECT.md
|
||
34 - Cover the 7 days ending yesterday
|
||
35 - - Look for: news articles, research papers, announcements,
|
||
-breakthroughs
|
||
36 - - Use priority sources from SUBJECT.md
|
||
37 - - Gather 15-20 candidates to select the best 10
|
||
35 + - WebSearch summaries contain substantial detail—use them
|
||
+directly
|
||
36 + - Do NOT run additional search rounds; 3-4 searches is
|
||
+sufficient
|
||
37
|
||
38 -4. **Select the best 10 items** based on:
|
||
38 +4. **Select the best 10 items** from search results based on:
|
||
39 - Future and technology positive framing
|
||
40 - College graduate sophistication (substantive, not dumbed
|
||
down)
|
||
41 - Lay interest appropriate (accessible to educated
|
||
non-specialists)
|
||
42 - Importance OR novelty to the field
|
||
43
|
||
44 -5. **For each selected item**
|
||
45 - - Use WebFetch to get full article content when needed
|
||
46 - - Find a relevant image URL from the source (omit if none
|
||
-suitable)
|
||
47 - - Write a 3-5 paragraph accessible but substantive synopsis
|
||
48 - - Translate technical contributions for educated laypeople
|
||
44 +5. **Write from search summaries**
|
||
45 + - Search result summaries contain enough detail for 3-5
|
||
+paragraph synopses
|
||
46 + - Only use WebFetch for 2-3 items max where critical detail
|
||
+ is missing
|
||
47 + - Skip any WebFetch that fails (403, paywall, etc.)—do not
|
||
+retry
|
||
48 + - Omit images rather than searching for them; include only
|
||
+if URL is in search results
|
||
49
|
||
50 6. **Write the article**
|
||
51 - Filename: `<subject-directory>/<subject>-MMDDYYYY.md`
|
||
(e.g., `ai-03092026.md`)
|
||
|
||
● Updated. Key changes:
|
||
|
||
- 3-4 searches max (in parallel), no additional rounds
|
||
- Write from search summaries - they contain enough detail
|
||
- WebFetch sparingly - 2-3 items max, only when critical detail is missing
|
||
- Skip failed fetches - don't retry 403s/paywalls
|
||
- Omit images unless URL is already in search results
|
||
|
||
This should cut the runtime significantly.
|
||
|
||
✻ Cogitated for 34s
|
||
|
||
❯ thanks. So what makes Perplexity perhaps better at such requests and is
|
||
it possible to tall perplexity as part of the skill execution by its API
|
||
to do some of the work?
|
||
|
||
● Why Perplexity is better at this:
|
||
|
||
1. Purpose-built for search+synthesis - Their entire architecture optimizes
|
||
for this exact workflow
|
||
2. Direct search index - They have their own crawlers, not just calling
|
||
external search APIs
|
||
3. Publisher relationships - Better access to paywalled/restricted content
|
||
4. Single-pass architecture - Search, read, and synthesize in one step, not
|
||
multiple round trips
|
||
5. Pre-indexed content - Often serving cached summaries, not fetching live
|
||
|
||
Yes, you can call Perplexity's API from the skill.
|
||
|
||
Perplexity has a REST API. You could either:
|
||
|
||
Option A: Direct API call in the skill
|
||
curl -X POST "https://api.perplexity.ai/chat/completions" \
|
||
-H "Authorization: Bearer $PERPLEXITY_API_KEY" \
|
||
-d '{"model": "sonar-pro", "messages": [{"role": "user", "content":
|
||
"..."}]}'
|
||
|
||
Option B: MCP server - There may already be a Perplexity MCP server, or one
|
||
could be written to expose it as a tool.
|
||
|
||
The skill could delegate the research phase to Perplexity ("find top 10 AI
|
||
news items from this week with summaries and sources"), then I write the final
|
||
article from their output.
|
||
|
||
Want me to update the skill to use Perplexity's API for research?
|
||
|
||
❯ I will come back to that question later. Another for now, Is there a way
|
||
I can save this session to a file?
|
||
|
||
● Yes, use the /export command:
|
||
|
||
/export
|
||
|
||
This exports the conversation to a markdown file (typically in your current
|
||
directory or a location you specify).
|
||
|
||
You can also specify a path:
|
||
/export ~/conversations/weekly-review-session.md |