Projects

Unifying Textual Prompt and Reference Image for Context-Preserving Object Insertion

Given a background, a prompt and a reference image of the object, insert the object into the background at a reasonable location while preserving visual consistency.

Enhancing LLM’s Coding Ability by Tree-Based Searching Methods

We implemented MCTS and Tree of Thought to guide code synthesis based on execution feedback.

Controllable Image Generation and Artistic Style Transfer

We implemented Dreambooth-LoRA and Text Inversion for style transfer based on Stable Diffusion v2.1.

Self-Certainty Guided Test-Time Scaling for Web Agents

We implemented a self-certainty guided R-MCTS method for web agents.

Ktransfer: Exploring Knowledge-Agnostic Prompt Domains for Cross-Domain Question Answering

We built and trained retrievers across knowledge-agnostic domains for multi-choice question answering.