about native ai models since 2024 (eg india summit ambani -huang)

nation braning eg my publications since 1989!!

deep trust mappaing/network/algoriths/brain influencing - neumaan 1956+ (booklet computer and the brain)

nvidi training blog

Agentic AI / Generative AI

Develop Native Multimodal Agents with Qwen3.5 VLM Using NVIDIA GPU-Accelerated Endpoints


Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this series is a ~400B parameter native vision-language model (VLM) with reasoning built with a hybrid architecture of mixture of experts (MoE) and Gated Delta Networks. Qwen3.5 can understand and navigate user interfaces, which improves on the previous generation of VLMs. 

Qwen3.5 is ideal for a variety of use cases, including:

  • Coding, including web development
  • Visual reasoning, including mobile and web interfaces
  • Chat applications
  • Complex search

Qwen3.5
Modalities Vision, language
Total parameters 397B
Active parameters 17B
Activation rate 4.28%
Input context length 256K extensible to 1M tokens
Languages supported 200+
Additional configuration information
Experts 512
Shared experts 1
Experts per token 11 (10 routed + 1 shared)
Layers 60
Words (vocabulary) 248,320
Table 1. Specifications and configuration details for the Qwen3.5 model

Build with NVIDIA endpoints

You can start building with Qwen3.5 today with free access to GPU-accelerated endpoints on build.nvidia.com, powered by NVIDIA Blackwell GPUs. As part of the NVIDIA Developer Program, you can explore quickly in the browser, experiment with prompts, and even test the model with your own data to evaluate real-world performance.


Video 1. Learn how to you can test Qwen3.5 on NVIDIA GPU-accelerated endpoints

You can also use the NVIDIA-hosted model through the API, free with registration in the NVIDIA Developer Program.  

import requests
   
   
headers = {
    "Authorization": "Bearer $NVIDIA_API_KEY",
    "Accept": "application/json",
}
   
payload = {
  "messages": [
    {
    "role": "user",
    "content": ""
    }
  ],
  "model": "qwen/qwen3.5-397b-a17b",
  "chat_template_kwargs": {
    "thinking": True
  },
  "frequency_penalty": 0,
  "max_tokens": 16384,
  "presence_penalty": 0,
  "stream": True,
  "temperature": 1,
  "top_p": 1
}
   
# re-use connections
session = requests.Session()
   
response = session.post(invoke_url, headers=headers, json=payload)
   
response.raise_for_status()
response_body = response.json()
print(response_body)

To take advantage of tool calling, simply define an array of OpenAI compatible tools to add to the chat completions tools parameter.

NVIDIA NIM makes it easy to take Qwen3.5 from development into production. Available as optimized, containerized inference microservices, NIM packages the model with the performance tuning, standardized APIs, and deployment flexibility enterprises need. Download and run it anywhere; on-premises, in the cloud, or across hybrid environments.

Customize with NVIDIA NeMo  

While Qwen3.5 offers impressive “out-of-the-box” multimodal capabilities, the NVIDIA NeMo framework provides the essential tools to adapt it for specialized domain needs. Using the NeMo Automodel library, developers can fine-tune the Qwen3.5 397B-parameter architecture with high-throughput efficiency.

NeMo Automodel is a PyTorch-native training library that offers Day 0 Hugging Face support, enabling direct training on existing checkpoints without tedious model conversions. This facilitates rapid experimentation, whether performing full supervised fine-tuning (SFT) or using memory-efficient methods such as LoRA.

As a reference implementation guide, developers can leverage the technical tutorial on Medical Visual QA, which details how to fine-tune Qwen3.5 on radiological datasets. For massive scale, NeMo supports multinode Slurm and Kubernetes deployments, ensuring that even the largest MoE models are optimized for domain-specific reasoning and complex agentic workflows with minimal latency.

Get started with Qwen3.5 

From data center deployments on NVIDIA Blackwell to NVIDIA NIM microservice for containerized deployment anywhere, NVIDIA offers solutions for your integration of Qwen3.5. To get started, check out the Qwen3.5 model page on Hugging Face and test Qwen3.5 on build.nvidia.com.

 Discuss (0)
+15

Tags

Start the discussion at forums.developer.nvidia.com

Views: 14

Reply to This

WHAT's DATA SOVEREIGNTY & WHAT CAN INTELLIGENCE DO? Today engineers can help peoples of any place be comparatively best at what their place on earth offers to generate. For example beautiful island might wam to be a toursist destination but overtime it (eg Galapagos) might want to develop intergenerational friendships so its teenagers can connect goodwill around the world as well as any skills eg medical or green energy the island most urgently need. Generations ago, Singapore did something different; its 6 million person poluation saw itself as at the cross-seas of world's first superport. It also gave back to region asean encouraging celebration of every peoples cultures and arts. It has aimed to be the 21st C most intelligent isle- where education is transformed by every 2nd grade teacher being as curious about what will ai do over the next 5 years as anyone else. Taiwan, addmitedly a 20 million person island, chose 1987 to become world number 1 as chip design changed to maximise customer requirements instead of the moores law era where at most one new chip a year would be designed in line with Intel's 3 decades of promising 100 times more capacity every decade.

In 2025, the vibrant aAInations index is one way of looking at where is place being led to maximise its peoples intelligence opportunities for evryone to win-win (network entreprenurially)

Happy 2025- free offer first quarter of 2025 - ask us any positive question about von neumann's purpose of intelligence/brainworking - by April we hope there will be a smart agent of neumann! - chris.macrae@yahoo.co.uk

Maths-Lab-Crisis.docx

Joun in perplexity chats 

Does AI have name for terrifying ignorance rsks eg Los Angeles failed insurance sharing

In these days of LLM modeling, is there one integral one for multilateral systems reponsibilities

Is Ethiopia's new secirity model an Africawide benchmark

can you hlep map womens deepest  intel nets

what can you tell us about ...


thanks to JvN

2025report.com aims to celebrate first 75 years that followers of Adam Smith , Commonwealth begun by Queen Victoria, James Wilson and dozens of Royal Societies, Keynes saw from being briefed 1951 by NET (Neumann Einstein Turing). Please contacts us if you have a positive contribution - we will log these at www.economistdiary.com/1976 www.economistdiary.com/2001 and www.economistdiary.com/2023 (admittedly a preview!!)

First a summary of what the NET asked to be meidiated to integrate trust during what they foresaw as a chaotic period.

Roughly they foresaw population growth quadrupling from 2 billion to 8 billion

They were most concerned that some people would access million times moore tech by 1995 another million times moore by 2015 another million times moore by 2025. Would those with such access unite good for all. If we go back to 1760s first decade that scots invented engines around Glash=gow University James Wat and diarist Adam Smith we can note this happened just over a quarter of millennium into age of empire. WE welcome corrections be this age appears to have been a hectic race between Portugal, Spain, France Britain Netherlands as probbly the first 5 to set the system pattern. I still dont understand was it ineviatble when say the Porttuguese king bet his nations shirt on navigation that this would involve agressive trades with guns forcing the terms of trade and colonisation often being a 2nd step and then a 3rd steb being taking slaves to do the work of building on a newly conquered land. I put this way because the NET were clear almost every place in 1951 needed to complete both independence and then interdependence of above zero sum trading games. Whils traidning things runs into zero sums (eg when there is overall scarcity) life critical knowhow or apps can multiplu=y value in use. Thats was a defining value in meidting how the neyt's new engineering was mapped. Of course this problem was from 1945 occuring in a world where war had typiclly done of the following to your place:

your capital cities had been flattened by bombing - necessitating architecture rebuild as well as perhaps an all chnage in land ownership

your peoples had gone through up to 6 years of barbaric occupation -how would this be mediated (public served) particularly if you were a nation moving from radio to television

yiu mifgt eb britain have been on winning side but if huge debt to arms you had bought

primarily you might be usa now expected by most outside USSR to lead every advance'

in population terms you might be inland rural (more than half of humans) where you had much the least knowledge on what had hapened because you had been left out of the era of connecting electricity and communications grids

The NETts overall summary : beware experts in energy will be the most hated but wanted by national leaders; and then far greater will be exponential risk is the most brilliant of connectors of our new engines will become even more hated and wanted. We should remember that the NET did not begin with lets design computers. They began with Einstein's 1905 publications; newtonian science is at the deepest limits systemically wrong for living with nature's rules.

WE can thrash through more understanding of how the NET mapped the challenges from 1951 at https://neumann.ning.com/ Unfortunatnely nobody knew that within 6 years of going massively public in 1951 with their new engineering visions, all of the net would be dead. One of the most amzaing documents I have ever seen is the last month's diary of von neumann roughly October 1955 before he became bedridden with cancer. All over usa engineering projects were receiving his last genius inputs. And yet more amazing for those interested in intelligence machines is his last curriculum the computer and the brain scribbled from his bedroom in bethesda and presented posthumously by his 2nd wife Klara at Yale 1957 before she took her own life about a year later. A great loss because while neumann had architected computers she had arguably been the chief coder. Just to be clear Turing also left behind a chief coder Jane who continued to work for Britain's defence planning at cheltenham for a couple of decades. Economistwomen.com  I like to believe that the founders of brainworking machines foresaw not only that women coders would be as produytive as men but that they would linking sustainability from bottom up of every community. At least that is a valid way of looking at how primarily 1billion asian women batted the systemic poverty of being disconnected from the outside world even as coastal places leapt ahead with in some cases (G Silicon Valley, whatever you call Japan-Korea south-Taiwan-HK-Singapore access to all of 10**18 times moore

Epoch changing Guides

1 AI Training AI Training.docx

 2 Exploring cultural weaknesss of encounters with greatest brain tool.docx

.2016-23.pptx

help assemble 100000 millennials summitfuture.com and GAMES of  worldrecordjobs.com card pack 1 i lets leap froward from cop26 glasgow nov 2021 - 260th year of machines and humans started up by smith and watt- chris.macrae@yahoo.co.uk-

WE APPROACH 65th year of  Neumann's tech legacy - 100 times more tech decade - which some people call Industrial Rev 4 or Arttificial Intel blending with humans; co-author 2025report.com, networker foundation of The Economist's Norman Macrae -

my father The Economist's norman macrae was privileged to meet von neumann- his legacy of 100 times more tech per decade informed much of dad's dialogues with world leaders at The Economist - in active retirement dad's first project to be von neumanns official biographer - english edition ; recently published japanese edition - queries welcomed; in 1984 i co-authored 2025report.com - this was celebrating 12 th year that dad( from 1972, also year silicon valley was born) argued for entrepreneurial revolution (ie humanity to be sustainable would need to value on sme networks not big corporate nor big gov); final edition of 2025report is being updated - 1984's timelines foresaw need to prep for fall of brlin wall within a few months; purspoes of the 5 primary sdg markets were seen to be pivotal as they blended real and digital - ie efinance e-agri e-health e-learning and 100%lives matter community; the report charged public broadcasters starting with BBC with most vital challenge- by year 2000 ensure billions of people were debating man's biggest risk as discrepancy in incomes and expectations of rich & poor nations; mediated at the right time everyone could linkin ideas as first main use of digital webs--- the failure to do this has led to fake media, failures to encourage younger half of the world to maxinise borderless friendships and sdg collabs - see eg economistwomen.com abedmooc.com teachforsdgs.com ecop26.com as 2020s becomes last chance for youth to be teh sustainability generation


 

© 2026   Created by chris macrae.   Powered by

Report an Issue  |  Terms of Service