Fork me on GitHub

Project Notes

#422 TRELLIS.2

About TRELLIS.2, an open-source image to 3D generation model. TLDR - very disappointing results from my initial tests.

Notes

TRELLIS.2 is an open-source 4B-parameter image-to-3D model producing up to 1536 PBR textured assets, built on native 3D VAEs with 16× spatial compression, delivering efficient, scalable, high-fidelity asset generation.

A ran a few tests using the demo hosted at huggingface, but am not getting great results.

Line Drawing Trial

Image generated with ChatGPT:

create a black and white outline drawing of a taiko drummer, in dramatic pose holding bachi. Only the figure, on a clear background without taiko or other elements

example-1

Not a bad image, but note the legs are not complete. This could be a common challenge for 3D model generators, given incomplete source material.

Let’s see how TRELLIS.2 performs, using the demo hosted at huggingface. I used default settings all the way:

generate-1

I exported the 14.1MB GLB file and loaded into blender:

blender-1

Evaluating the results:

  • It wasn’t smart enough to complete the missing leg detail
  • Hidden hand details are poorly completed anatomically incorrect
  • Generated some random “floating” elements

In all, a decent start, but would require a lot of cleanup to be a usable 3D model.

Photo Trial

I started with this image from https://southwestfolklife.org/ken-koshio-taiko-player/

example-1

I generated a model using default settings using the demo hosted at huggingface:

generate-1

Evaluating the results:

  • useless; very disappointing
  • clearly failed to properly comprehend the geometry of the source

Credits and References

About LCK#422
AI

This page is a web-friendly rendering of my project notes shared in the LittleCodingKata GitHub repository.

Project Source on GitHub Return to the LittleCodingKata Catalog
About LittleCodingKata

LittleCodingKata is my collection of programming exercises, research and code toys broadly spanning things that relate to programming and software development (languages, frameworks and tools).

These range from the trivial to the complex and serious. Many are inspired by existing work and I'll note credits and references where applicable. The focus is quite scattered, as I variously work on things new and important in the moment, or go back to revisit things from the past.

This is primarily a personal collection for my own edification and learning, but anyone who stumbles by is welcome to borrow, steal or reference the work here. And if you spot errors or issues I'd really appreciate some feedback - create an issue, send me an email or even send a pull-request.

Follow the Blog follow projects and notes as they are published in your favourite feed reader