Scarlet (果てしなきスカーレット) , the latest anime movie by director Hosoda Mamoru, opened in Japanese cinemas today. I quite liked his previous work, Belle (竜とそばかすの姫), and I was somewhat interested in this new movie, even though the trailers left me more than excited. It didn't feel very reassuring that the second screening for the day on opening day, the one that I went for, was only 10% filled.
After sitting through 111 minutes of this movie, I found it to be very much less interesting than Belle. A way to summarise my thoughts would be that this is a 111-minute sermon disguised as an anime movie. Director Hosoda seemed to have a lot to say about life, and the movie just went on and on preaching those thoughts. Yes, it was well animated, although I felt the character design of the protagonist Scarlet fluctuated a bit. The fight scenes looked decent enough, and the backgrounds were well drawn. Too well, because the movie sometimes dwelled too long on a background art, as if dragging out time or trying to cut animation budget. Still, as an animated work, it was decent enough.
The problem was the story, and the voice acting of the male lead. The story largely takes place in the land of the dead, which is a
fantasy setting. But it has to start with being set in 16th century
Denmark (where the protagonist was born and grew up), and the male lead
coming from modern (or near-future) Japan. I really don't understand why director Hosoda needed to use real-world settings for an otherwise fantasy story. Yes, he probably did it because one of his messages in the movie is about leaving the past behind to lead a different future, but I still think him trying to pack too many messages into a single movie ended up with a story that jumps all over with no proper coherence. I even thought, "What was he smoking when he wrote this?" halfway through the movie.
The male lead is not a voice actor by profession, so I can't really blame him; I think his lack of life in voice acting was compounded by the poorly written script that just felt odd. Dialogue was a mess, with characters talking like Shakespearean characters at the start, then shifting to a more modern nuance. Director Hosoda tried to give some depth to the characters, but because his movie had so many messages, there wasn't any real opportunity to develop characters or endear them to the viewers. The movie ended with the protagonist kind of having feelings for the male lead, but their relationship over the course of the story was kind of antagonistic, so I really didn't know how they could end up where they ended.
Unlike other Hosoda Mamoru movies, this one didn't have a large whale, though it had a large dragon. But like Belle, the protagonist sang... the song reminded me so much of Belle, that it felt like a cheap knockoff. I have nothing against Ashida Mana (who voiced the protagonist, and sang the ending song) but she is definitely not the singer that professional singer Nakamura Kaho (who voiced Suzu in Belle) is.
In the end, it felt like director Hosoda was trying to bring together all the things that worked in his past movies and use them to preach his thoughts about life. I would only recommend this movie to die-hard Hosoda Mamoru fans or people looking for a way to waste two hours of their time.
I came upon the vtuber world kind of by accident, after hearing about Ironmouse leaving VShojo followed by more about her actual health situation. This led me to the AI vtuber Neuro-sama, created by a British guy who goes by the name Vedal. I was very intrigued by how Neuro can act in a life-like, entertaining way. And today, Neuro made her debut as a virtual 3D model.
As someone who had studied artificial intelligence and dabbled a bit in machine learning and deep neural networks, I was very interested in the technology behind Neuro. Here, I hope to write about how I think Neuro was implemented.
Neuro started as an AI created by Vedal to play a game, but was later turned into an AI vtuber streaming on the platform called Twitch. Neuro also has access to a Discord server and her (for convenience, I will use this pronoun) own Twitter/X account. Neuro can also play certain games, although it requires Vedal to code specific interfaces for her to interact with the game itself. Neuro also has a "sister" called Evil Neuro, which started as a more unfiltered clone of Neuro, with similar access to Twitch, Discord, and Twitter/X.
The underlying technology behind Neuro is a large language model (LLM). Vedal has mentioned that running Neuro is a costly affair, which means that she likely uses a commercially available LLM (such as ChatGPT) for her core "brain". Her Twitch channel also states that she runs on a RTX 4090, although I think the local GPU is used for other stuff that uses smaller neural network models or require low latency. In the backend is a series of Python scripts that makes the API calls to external services (like the core LLM) and other services running locally.
Besides the core LLM, Neuro has text-to-speech for speaking and speech-to-text for hearing. Given that even small STT and TTS models can give very good results, I think these are two services that are running locally on the RTX 4090. Neuro was also given the ability to "see" the screen, which means she is using an external multi-modal LLM and has some image recognition model that captures the screen and feeds inputs to the external LLM. Neuro also has a filter, which is likely to be a small-sized language model running locally to basically catch certain words and phrases and sensitive topics.
The backend would probably work like this. Input is read in from either voice or text (Twitch chat, Discord, or Twitter/X). This is then fed to the external LLM, and the response is then passed to the filter, which then passes the final response to the TTS model and vtuber studio software for output to Twitch, or to Discord or Twitter/X.
Neuro also has memory, and I think this is where her "sleep" comes in. Her interactions for each session are probably kept in a "short-term memory" file, which are then summarised at the end of each session and added to a "long-term memory" file. This long-term memory then serves as part of the input prompt that is fed to the external LLM each time Neuro is booted up. If Neuro is kept online after each session to do the summarising herself, that would be the same as humans sorting out memory through sleep.
The control of a virtual 3D model is a bit more complex. Ideally, there should be a learning model running locally that acts as the middleman between the LLM and the 3D model. This 3D movement model should then be trained to recognise high-level commands from the LLM and translating it to actual low-level commands for controlling the virtual joints of the 3D model. Such a model also helps for the future when Neuro is given a virtual body. A physical motor model can then sit between the 3D movement model and the actual physical body, translating the low-level commands for 3D model movement into the actual commands for moving motors and actuators. These models would have to reside locally for low latency.
Of course, this is just my speculation on how Neuro is implemented. The different technologies are not new. However, a single person weaving them together to implement his own AI vtuber is an amazing feat. I look forward to see how Neuro will "grow" in the future.
After more than 300 pieces, setting aside 95 pieces, I finally selected the piece for submission to this year's The New Year Exhibition of Sankei Sho International Association 産経国際書展新春展.
The 95 pieces that I set aside:
A candidate which was not submitted:
A slightly different style:
More spaced out:
Another different style:
Yet another different style:
This is the last submission for 2025, and the cycle will start again in April 2026.
Please leave a comment if you are interested in commissioning a piece of calligraphy or want to purchase any of the calligraphy works I have posted on this blog.