Apr 10, 2024
2241 words
Booting PostMarketOS

After getting rid of the crappy UI and installing LineageOS on my Redmi Phone, I was still not satisfied. Mainly because I am using WeChat too frequently, and it is kinda addictive and time-wasting. Those

Today I decided to play around with the BSD distros. I have used many Linux distros and found they are pretty much alike, no big difference anyways except for package managers. Anyways, I hopped on

Recently, the idea of self-hosting was so intriguing that I decided to code a full-stack video platform. There are many Object Storage providers, including Amazon AWS, Akamai, Digital Ocean, Cloudflare, Alibaba Cloud. Nearly all of

Mar 09, 2024
1968 words
Booting LineageOS(Finally)

I've failed countless times while experimenting with different phones and operating systems. Unlike computers, which have a universal architecture, each phone possesses its unique structure. This makes installing operating systems on them exceedingly challenging. Furthermore,

Nov 29, 2023
452 words
Offline RL

Note: All Yaml files are in the git repo: https://github.com/jimchen2/cs285-reinforcement-learning python cs285/scripts/run_hw5_explore.py \ python cs285/scripts/run_hw5_explore.py \ python cs285/scripts/run_hw5_explore.py \ The Random Network Distillation algorithm encourages exploration by training another neural network to approximate the output

Nov 24, 2023
521 words
Q Learning and SAC

Compute Action and use epsilon greedy action = torch.tensor(random.randint(0, self.num_actions - 1)) action = self.critic(observation).argmax(dim=1) Step environment Add data to replay buffer replay_buffer.insert(...) Sample from replay buffer batch = replay_buffer.sample(config["batch_size"]) Train agent, we update the

Nov 20, 2023
872 words
Policy Gradients

There are 2 kinds of estimator for Policy Gradients, full trajectory and and "reward-to-go" We run the two configs on Cartpole with different parameters, specifically, rtg means reward to go, na means normalizing the advantages.

Nov 18, 2023
371 words
Imitation Learning with DAgger

We run imitation learning and Dagger based on expert policies. In this experiment the expert policy is directly sampled out from a trained Neural Network, so Dagger differs from real world applications in that it

Oct 21, 2023
864 words
Setting Up Raspberry Pi OS

Setting up and configuring Raspberry Pi OS can be a streamlined process by using chroot on a different machine. This guide will walk through the steps to prepare your Raspberry Pi SD card on a

Oct 14, 2023
584 words
Grub Bootloader and LFS

So grub is basically a bootloader to start the system. The point here is I've been using grub for almost a year now without a deep understanding. Grub boots the system by using grub shell.