Never Supervised

Intentional Assimilation

2024-11-17T08:01:02+00:00

The paper Does the “Melting Pot” Still Melt? Internet and Immigrants’ Integration aligns with feedback I’ve often shared with immigrants who come to the U.S. for college or early in their careers.

I came to America in pursuit of opportunities that weren’t available to me in Argentina. These opportunities are not a fluke but a result of intrinsic properties of America, including its culture.

Culture and values tend to cluster. If we independently develop a value and then find a group that strongly expresses that value, there will be an expectation that we adopt a substantial fraction of their doctrine to be accepted into the group. This is why we find strong correlations across party lines, such as liberals being more likely to be vegetarian and professors being less likely to be conservative.

At first blush, this seems oppressive, and in recent years (writing this in 2024), it has led to unnecessary infighting within organizations that otherwise share a common objective. However, because science struggles to determine causality, it’s not clear whether any of these correlations are actually meaningful. So, if our goal is to emulate a certain group and achieve similar characteristics (wealth, education, happiness), we must be careful when cherry-picking values and behaviors.

Which brings us to being an immigrant. If we aspire to become representative individuals, the simplest behavior is to assimilate. This term might evoke unpleasant historical precedents, such as immigrants being persecuted for maintaining their religion. I’m referring to a more charitable interpretation of assimilation—one that’s not about forgetting who we are or what we care about, but about learning and adapting to a new environment.

I think it’s fairly uncontroversial to say that learning English improves our chances of success in America. Furthermore, the better our grammar, vocabulary, and pronunciation, the higher our chances of success. But there are more nuanced facets to this assimilation. Consider communication styles in professional settings. Argentina is far more informal, with important business communication often happening over unscheduled phone calls or interminable WhatsApp voice messages. In the U.S., written communication and calendar scheduling are emphasized. So, one could develop perfect language proficiency and still be perceived as unprofessional.

The nuance doesn’t end there. In fact, communication alone is an endless rabbit hole of behavioral cues that take time to learn. Anything from giving a wedding toast to succeeding in a professional interview fits inside an Overton window. This means that assimilation requires significant learning and adjustment. It takes time and becomes harder as we age, so it’s imperative to dedicate substantial energy to the process.

Unfortunately, immigration can be a lonely journey. Many recent immigrants fantasize about one day returning home. We miss our food, entertainment, and friends. So we read foreign news, actively participate in chat groups and social media, and keep up with foreign films and TV. We might also find a group of expats and spend much of our social time interacting within a microcosm of our culture and language of origin.

It’s reasonable to think socialization or personal relaxation is unrelated to professional development, but that would be a mistake. Reading local news, watching popular movies (with English subtitles), and having casual conversations with new friends or on dates are essential exercises. I bet a research study would convincingly show that faster assimilation leads to higher success rates across key metrics like wealth, happiness, and longevity.

This is hard feedback to give. It’s uncomfortable to tell someone they should consider stepping back from their past to embrace their new environment. It’s one of those topics where we carefully choose our words and watch the other person’s involuntary response. But it’s important, practical advice I share whenever a new immigrant asks for guidance.

Abstract: The global spread of the Internet and the rising salience of immigration are two of the biggest trends of the last decades. And yet, the effects of new digital technologies on immigrants—their social integration, spatial segregation, and economic outcomes—remain unknown. This paper addresses this gap: it shows how home-country Internet expansion affects immigrants’ socio-economic integration in the U.S. Using DID and event-study methods, I find that home-country Internet expansion lowers immigrants’ linguistic proficiency, naturalization rates, and economic integration. The effect is driven by younger and less educated immigrants. However, home-country Internet also decreases spatial and occupational segregation and increases immigrants’ subjective well-being. Time-use data suggests that the Internet’s impact on immigrants’ networking is part of the story. I also show the role of return intentions and Facebook usage, among other factors. These findings align with a Roy model of migration augmented with a choice between host- and home-country ties. Overall, this paper shows how digital technologies transform the immigration, diversity, and social cohesion nexus.

Link to paper

Sparcity is Entropy to Innovation

2024-10-31T08:01:02+00:00

Most companies in our portfolio are distributed to some degree, and I believe remote teams are here to stay. But founders should recognize the inherent challenges and take measures to mitigate them.

In recent years, I’ve observed a religious fervor in favor of remote work, particularly in tech. There’s plenty of thought leadership making persuasive arguments in favor of distributed teams. Opposing views, such as Amazon’s return to office, are usually met with outrage. I believe this topic is more nuanced, particularly in the context of nurturing innovation.

Conway’s Law tells us that systems will be shaped by the underlying organizational structure. This idea was first discussed in 1967, when the constraints were captured by nodes and edges in an org-chart. Today we are operating in a multi-dimensional space where, in addition to reporting structure, individuals can be distanced by geography, time zone, language, cost (flights, hotels), and so on. Even for a particular tolerance of “distributedness,” the precise organizational topology matters: distance between teammates is worse than that between densely clustered teams.

When I engage in some friendly sparring online, the same flawed assumption bubbles up: that innovation in technology is primarily transactional, a simple matter of assigning tasks and waiting for results. I’ve heard countless perspectives about how people “just get their stuff done” faster from home, or how office presence is merely performative. Some brag about multiple full-time remote positions, presenting it as a way to fight the oppression of corporate employers. A manager can signal virtue with claims of “not caring as long as the assigned work is completed.”

But “getting your work done” misses the essence of innovation in technology. There is never a master plan that simply needs execution. The trajectory of a company emerges from the creative chaos in between top-down vision and trench warfare. In technology, we don’t pay for completed tasks – we pay for ideas, judgment, and intuition. Anyone can have a curve-bending epiphany, nourished by seemingly casual conversations that happen more naturally in person. The conversations that happen over beers or weekend BBQs spawn ideas that no amount of Zoom calls could replicate. It isn’t about mandatory fun or forced socialization – it’s about the natural exchange of thoughts when barriers drop.

I’ve seen this dynamic play out in what I call the “black market of favors” – informal collaborations that arise between colleagues who have built real relationships. As a developer, I often found myself prototyping features quickly for product managers who had become friends, motivated not by formal requirements but by personal connection and the prospects of free dinner. These off-the-books exchanges often drive innovation more effectively than formal processes.

The same patterns emerge in hiring. The best talent flows through networks of people who’ve worked together and want to do so again. It’s why tech hubs exist, and why Silicon Valley became Silicon Valley. The assumption that we can replicate these network effects through purely digital means remains unproven.

Incentives matter too. Remote can push us toward transactional relationships. Managers are left with blunt motivators such as terminations and promotions. Stock compensation can help, although it’s trickier with distributed teams. But the bigger loss is purpose. People don’t push boundaries because of KPIs or quarterly reviews. They do it to prove something to peers they respect, to justify the trust of those who brought them in, to be part of something bigger than a list of deliverables.

Building a successful technology company is extraordinarily challenging. While distributed teams offer undeniable advantages in terms of talent access and operational flexibility, there are limitations. The goal shouldn’t be to assemble a workforce – it should be to build a team, with shared purpose and genuine relationships. This requires careful consideration of how we structure our organizations, balancing the benefits of flexibility against the inherent human needs that drive innovation.

Thoughts on AI (early 2024 edition)

2024-03-30T08:01:02+00:00

This is the transcript of a video series I recorded to share some perspectives on AI for the layperson. I did everything from memory and in one pass, which means I made a couple errors.

How AI has progressed since 2010s

Jensen Huang, the CEO of Nvidia recently said 2012 was the “first contact”. He was referring to AlexNet, the first example of a deep neural net achieving “supremacy” at a task. Supremacy in AI refers to outperforming an average human at a specific task, in the case of AlexNet it meant image classification (e.g.: a particular dog breed). The algorithms used in AlexNet weren’t new, but prior to this time, there wasn’t a practical way to train these models due to the computational requirements.

AlexNet had about 50M parameters. A parameter can be thought of as a knob that can be adjusted to calibrate the model and improve predictions. For example a binary classifier needs fewer parameters than a multi-class one. The more knobs, the more complex the models and their predictions. But bigger models also require more computation. For example, achieving supremacy in some machine translation tasks using deep nets required 500M parameters and it didn’t happen until 2016.

As GPUs have improved, and have evolved from gaming PCs to server racks, models have grown exponentially. GPT-3 has 175B parameters. GPT-4 is thought to have 1.75T parameters. And the next generation of models will continue this exponential trend.

What are GPUs?

Graphic processing units (GPUs) had been in development for over a decade by 2012. By chance, these happen to be highly suitable for model training. It turns out the same type of math required to render complex 3D games is applicable to training neural nets. GPUs are very simple and very parallel– they have thousands of “cores”, unlike CPUs which have dozens of cores. By 2012 GPUs were powerful enough to train a type of model called a convolutional neural net, in a single gaming PC.

The main driver of computational capacity has been Moore’s law. This is the idea that by making transistors progressively smaller, we can fit more of them in the same chip and require comparatively less power. Between 1970 and 2012, transistors per CPU increased by a multiple of 1M, but wattage went up by less than 1000x.

Bleeding edge transistors today are about 24 nm, ( which is confusing because the Fabs that make these chips call the process 3 nm ). While there are a few Fabs that can make transistors this small, such as Samsung. There’s only one manufacturer that can make the most advanced chips needed for GPUs and iPhones. This manufacturer is TSMC and is located in Taiwan. These chips are made using a process called Extreme UV Lithography, and require a machine that only one manufacturer, ASML (in the Netherlands), makes. Each machine can be $100s of M and they are the size of a small bus.

This delicate supply chain leaves open the question of what will happen as the world becomes less globalized.

What are some current bottlenecks for AI improvement?

Computational capabilities are improving fast, but demands for ML have increased even faster. While early models were trained on a gaming PC, more recent ones require many GPUs inside sophisticated data centers. The GPT-3 generation models might have cost $5 - $10M, whereas the next generation of language models (GPT-5) will likely cost 10x that much. If parameters keep growing faster than Moore’s law is improving, which seems to be the case, each following generation will become progressively more expensive.

In addition to compute, larger models require more data. This is another challenge since current state of the art models are already being trained on most of the internet and presumably the majority of digitized books. There is more data out there, for example chats and emails, but it’s lower quality data which might have less impact than a textbook. It seems that for text information, we might be approaching an asymptote.

We are still in the early innings.

We can still extract more knowledge by training models from videos and other media formats, but we still need technical advancements to fully leverage them (e.g. learn physics from video).

Semiconductor companies are building more specialized chips designed to run specific types of neural nets. Likewise, there are opportunities for algorithmic improvements that will reduce the computational complexity of some of these problems.

A lot of work has been done with reinforcement learning, for example by letting a robot learn to play a video game by just playing for many hours and receiving feedback directly from the game. With games, the rules are clear, so it’s relatively easy to learn (although it can take a long time). In the case of general knowledge, it is harder. But it’s possible someone will figure out how to have two agents debate topics endlessly and use external knowledge when necessary to learn from their own debate.

How AI interacts with the economy

Software has been helping automate processes. But AI seems to bring a different quality. While traditional software has provided sophisticated tools for humans to accomplish objectives. AI is becoming capable of fully automating entire tasks. AI can copy-edit an essay in the style of the NYT. AI can generate a logo. Both of these tasks used to require a professional.

It’s interesting to consider how this will affect the economy. For example, progress with automation in manufacturing has resulted in deflationary forces. Broadly, products tend to become cheaper over time as we automate. Even when something appears to cost about the same, like a car or a laptop, the quality of these products has improved, (so on a per-capability basis, they are cheaper).

In contrast, services, which require having humans in the loop, such as healthcare, education, law, have been inflationary. This is also referred to as the Baumol cost disease. Traditionally the boundary between products and services meant technology didn’t have the same degree of impact in cost. But with AI this is bound to change. As a consequence, we should expect to see decreasing costs and improved quality of services.

How AI impacts professions

Like every tech shift, some jobs will be displaced and new ones will be created. After the invention of automatic switches, there was no longer a need for phone operators. Some of these operators were late in their careers and struggled to adapt. But nobody in the subsequent generation pursued that profession, so the market solved the problem.

With this wave of AI, there will be displacement, and it will be difficult for some people. It’s possible the displacement is of a higher magnitude than previous shifts. But the economy will adapt. As to which professions will be most affected, it’s not always clear. When cars become self-driving, it’s very likely most driver jobs will go away. However, we will probably still want human judges, even if models become very good at predicting trial outcomes.

There’s something called Jevon’s Paradox which says that when a natural resource becomes cheaper, consumption actually goes up. This works as long as demand is much higher than supply. For example if horses were cheaper, more people would have them, but not nearly to the point of everyone having a horse. On the other hand, if legal services were much cheaper, it’s possible the demand would be orders of magnitude higher (e.g.: you could draft a contract with your spouse to resolve each disagreement). In this sense it’s possible that heavy automation of many legal tasks will actually increase demand for attorneys, even though they would each be more productive.

How smart is AI compared to humans?

AI has achieved supremacy in specific tasks, meaning that it can reliably beat the average person in translation or image classification. But humans are still computationally more capable and our intelligence is of a more general nature.

The biggest supercomputer today, Frontier, is capable of 10^18 FLOPS - floating point operations per second. Some estimates consider this to be in the range of a human brain. But these supercomputers cost $100s of millions and use 20 megawatts, enough to power a small town. The human brain requires just 20 watts - a few dollars worth of food per day.

Even if we drastically reduce the cost to build and operate these machines, and be competitive with humans. We still haven’t figured out how to leverage this computational capability to build general intelligence. While simply scaling models to be progressively larger has surprised researchers with “emerging” capabilities that were unexpected. Most in the field think that there are still a few discoveries missing before FLOPs can be converted to general intelligence.

That said, things are moving fast and prediction markets estimate that AGI will arrive in the 2030s

How can someone allocate assets towards AI?

The market clearly recognizes the opportunity which is why stocks like Nvidia have grown tremendously. But we are in the very early innings and it is possible the most valuable companies of the AI era have yet to be founded. So a lot is going to be happening in private markets.

There are some private companies building foundational technology, such as large models and new specialized hardware. These companies require a huge amount of upfront investment. The investment can be so large in fact, that even the biggest VCs are hesitant which is why much of the capital is coming from public companies like Nvidia, MSFT, Amazon. And the “capital” is often coming in the form of computation capacity instead of cash.

On the other hand, you have applied companies that are trying to leverage the new technology for existing use-cases. If I was a founder right now, I would be going after this use-case. Virtually every industry will be greatly impacted by AI, so it’s a matter of picking a space and going after it. This is also where our fund ScOp venture capital is focusing on. We believe Vertical AI will be the spiritual successor to Vertical SaaS.

I believe that even as foundational models get better, there will still be a meaningful advantage to packaging them for a particular use-case. So rather than going to ChatGPT and asking it to be your lawyer, there will be an attorney GPT that understands the specific laws that are relevant to your context, has guardrails, and provides guidance on how to use the information.

How do we progress with AI responsibly?

TBD

Word2Vec from Scratch

2024-02-08T08:01:01+00:00

Word2Vec was a pivotal paper published a decade ago by researchers at Google. They showed that by attempting to predict a word from their neighbors (or the neighbors from the word), the resulting model acquired compelling semantic capabilities.

The main element of this model is a n x m matrix, where n is the vocabulary size and m is the dimension of the vector to encode each word. A typical value from m is 300. Rather than having an identifier for each of the roughly 200k words in the English language (in addition to proper names and numbers, which are also words), each word can be represented with a coordinate in 300-dimensions. This is known as an embedding, and is a primordial type of language model (and a component of most language models since).

The implication is that if two words are close to each other in this 300-dimensional space, they are similar. Hence, “dog” and “cat” would presumably be closer to each other than “airplane” is to either of them. This is the so called cosine similarity, enabled by vector databases like Pinecone. Furthermore, each of the 300 axes can be thought as having a meaning (although it might not be monosemantic). So for example “dog” would score high on the axis intended to represent “animalness”, whereas “plane” would score low.

It’s important to understand that this sort of distance calculation wouldn’t be possible if each word was identified with a sequential ID, since that’s equivalent to having a single dimension. Having higher cardinality allows many words to cluster together around central concepts, like “animals”, “computers”, “politics”, and so on.

The shocking result is that this structure is conducive of performing a sort of arithmetic with words. The canonical example being “king” - “man” + “woman” = “queen”. Performing this arithmetic operation on the vectors for the terms in question will yield a new vector within a short proximity of “queen”. Proximity, as opposed to identical, is an important characteristic. The relationship between “Paris” => “France” and “Rome” => “Italy”, will be similar but not identical, given that it was learned from statistical properties of a text corpus. So the inexactness is a feature.

To learn, I implemented word2vec from scratch (following Olga Chernytska ). Run word2vec_py and play with the results using Inference_ipynb

Take a look at this interactive visualization. Try to interpret what is different about the two green clusters.

Note: “king” - “man” + “woman” doesn’t always work as expected. When embeddings are trained, dimensions acquire different meaning, due to randomness. In order to work, the embeddings have to learn the right properties (e.g. male vs female). I suspect a larger training set would improve results. After several runs, I ended up with these n-best results:

king: 0.782
queen: 0.535
woman: 0.516
monarch: 0.515

Personal Goals 2023

2023-11-29T08:01:01+00:00

This document summarizes my goals for the year 2023. The goals themselves are not listed in any particular order, but they are grouped by category. I am doing this for myself. Writing this down forces me to spend a few hours towards the end of the year to reflect on and appreciate everything that I’ve done. As a secondary objective, something here might inspire someone else to follow through on their goals.

Purpose (profession, career, interests)

I don’t believe myself to be predestined to anything, but I want to work on projects that are relevant. I don’t think relevant is strictly the same thing as “good”. In fact, “good” is often an excuse for justifying things that don’t matter. One approach is to be utilitarian: work on things that will have a durable contribution to GDP. I get the most purpose from having goals that help me develop in new ways. If I can see myself evolving across many dimensions and see a path forward, I generally feel purposeful. To put it simply, having things to do, which are also challenging, interesting, and important, gives me purpose.

(score A) Spend at least 100 hours coding/studying ML (aka AI): I built several side projects that made use of LLMs. (1) Tiny Slack GPT is a Slack bot which takes entire conversations as context and can do things like summarizing the viewpoints of different participants in a discussion. It can also assist in a discussion by pulling excerpts from our organization’s documents and database. (2) Promptic is a UI to construct dynamic prompts by selecting options. I made this as an experiment for HeyTutor where we were exploring AI lesson creation. The idea is to select subject, difficulty level, learning style, etc., and feed that to an LLM with zero prompt engineering. (3) I prototyped an LLM-based scraper to extract KPIs (i.e.: ARR, ACV) from notes/transcripts from conversations with founders. (4) I prepared an information-dense presentation that discusses implications of AI for the broader economy and presented it in person at various venues.

(score A) Work with 3 new people I admire: One of the benefits of being an investor at ScOp is getting to work with a portfolio of companies. It also means there are many opportunities to form new professional relationships with people I already know and respect. This year, I was involved in recruiting 4 CTOs: Alex Wilson to Cloverleaf, Phil Gabardo to Lionize, Tumas Rackaitis to Rogo, and David Seigle who I directly hired during my time at HeyTutor. Also at HeyTutor, I had the pleasure to collaborate very closely with Jen Sheffield, now the CEO. Later in the year, I also got to work with my former colleague Heike Schirmer as an executive in residence at ScOp, previously a Director at Amazon Alexa.

(score B) Work more closely with ML companies in portfolio: We have many companies in ScOp’s portfolio doing interesting work in ML/AI, including Yogi, Rogo, Unwrap, Cloverleaf, Customers.ai, and Flip. While I’ve spent a fair amount of time looking at their technology challenges and providing support where possible, I still believe my impact could be higher.

(score A) Write 2 blog posts: I wrote many posts, but LLMs are making writing too easy, so I no longer think this is an important goal. In fact I’ve been thinking about the future of authentic thought leadership. I used to believe that if I had the best ghost writer working for me 24/7, I would be 100x more successful. But now everyone has a great ghost writer. This means that content is no longer trustworthy or valuable. What comes next? So far, I can’t think of many options other than returning to in-person interactions.

(score A) Attend 1 AI conference: NeurIPS

(score F) Complete transaction of Redacted: I thought there was a 50/50 shot at completing a portco transaction in 2023, but it will have to wait. The company is thriving, though, so when the time comes it will be big.

Selection of media I consumed

Movies & Miniseries                 Shows  
--------------------------------    --------------------------------   
The Fablemans                       Tulsa King
Oppenheimer                         Silo 
The Menu                            Barry 
Poker Face                          Fargo (series)
Lessons in Chemistry                The Last of Us
Capernaum 
The Covenant                        Audio & Text
The Killer                          ----------------------------------
Your Honor                          Founders #311 James Cameron
The Whale                           Founders #263 and #264 Edwin Land
Living                              Founders #260 Dee Hock
Daliland                            Invest like the Best, Palmer Luckey
Armageddon Time                     Invest like the Best, Patrick Collison
Air (Nike)                          Invest like the Best, Josh Kushner
All Quiet in the Western Front      Invest like the Best, Bessemer
                                    Invest like the Best, Doug Leone
Misc Video                          How to do Great Work by Paul Graham
--------------------------------    The Brain that Changes Itself
To Be (link #1)                     Chip Wars
Andrej Karpathy's YouTube           A Fermi Paradox Story (link #2)

_{^{link #1: To Be}}
_{^{link #2: A Fermi Paradox Story}}

Connection (partnership, intimacy, family, friends)

I appreciate the meaningful connections that I’ve made throughout my life, including those that were powerful but short lived. Relationships are an important part of life, and I hope my future includes rich experiences with old and new friends. Unfortunately, everyone’s circumstances and interests change, and that means relationships that were once fresh and relevant tend to fall into the background. My observation is that most people allocate significant time and energy to maintain relationships that would otherwise go stale, whereas I’m okay with some relationships dissipating and others flourishing organically. Having a lot of time to myself is extremely important. I enjoy spending most of my weekends diving deep into a new topic, usually tech related, or hiking/running/biking. This creates some tension with maintaining casual relationships, as they usually involve activities in the weekends. Furthermore, casual social activities tend to not start on time and drag on longer than anticipated, which ends up eating into my coveted solo time. I generally prefer social time involving intellectual conversations, shared projects, physical activity, or interesting/new experiences. I believe most people want to separate their personal life (including friends) from work, but for me the ideal friendship is one where there is a high degree of professional intersection. My relationships with family are predicated on the same parameters as other relationships: a history of interactions, common interests, compatible personalities. Fortunately, I was able to find a life partner that shares many of these characteristics. Given that she’s further in the autism spectrum, I’ve been able to reconcile some of these personality traits with my own neurodiversity.

(score A) Greece Trip with Julia: Julia and I had an amazing trip to Greece in the summer. She had taken a classics class and wanted to see and feel the ruins. We rented a car and drove hundreds of miles to visit several ruins. Among other things, I learned that derelict columns are very very important to the Greeks. We walked a ton, ate great food, took a gazillion pictures, and generally strengthened our bond.

(score B) 3 Trips with Friends: I had more than 3 trips with friends, but I also bailed on a backpacking trip in Alaska with Condog, and I had to cancel a mountaineering trip with Dan due to an important work obligation. Even though I don’t regret these decisions, I don’t like being unreliable. It’s one thing to deprioritize casual social events, and an entirely different one to make weak commitments. This is something I need to improve on. Fortunately, I still had the chance to go skiing with many friends in Colo and Mammoth, and spend time in Reno with the Brain Burn crew. I also planned a last minute work trip to NYC with Heike which ended up being great.

(score A) Go to Outside Lands with Julia: This was our second time going to the event together, so now it’s a tradition. Weather was very nice, much less sunny than last year. The most notable moment (a loooong moment) was when we decided to make a front-row attempt during the Foo Fighters performance. We made the call at 2 or 3 pm and fought our way to the absolute best spot, front-center, right by the railings, and watched the performance 8-10 pm. The weather and empty bladders were essential for the success of the mission. Not sure if I would do it again, but it was fun to do it once.

Fitness (strength, endurance, physiology, nutrition)

I feel better when I eat healthy and exercise. Lately, I’ve started to pay attention to my aging and experimenting with methods that can soften the transition, such as being more careful with sun exposure and avoiding products that dry my skin. I’m best at sticking with goals when there are clear KPIs (that move up), so I am an avid user of Strava and do my best to keep a strict fasting schedule.

(score A) Go on 75 runs / 150 miles: 81 runs, 206 miles, 23k ft elevation. I have been less focused on cardio, so I gave myself a relatively easy goal. Most recently, My outdoor running has been negligible. Indoor cardio is just as good, but harder to measure, so I need to find an alternative or pick up my running again.

(score B-) Do Gibraltar 12 times: Gibraltar is a local bike route in Santa Barbara that involves about 4000ft of elevation gain, so this would have been 48k ft. I did about 60 one-third-Gibraltars, totalling 60k ft over 434 miles.

(score B) Pick-up New Workout: I wanted to keep things interesting with a new sport. I considered some edgy candidates such as roller blading but ended up keeping it simple. I started with Yoga in the first half of the year, but then switched to weightlifting, which is sort of cheating because weightlifting was its own goal. But I’ve been very dedicated, spending about 5 hours a week lifting, so there isn’t time for anything else.

(score B) Start Weightlifting: Doing well but started too late in the year.

(score B) Manage Sun Exposure: Mostly doing well except a monstrous sunburn the last day of Burning Man.

(score A) Lower Cholesterol: Started taking simvastatin. Cholesterol dropped and I’m feeling great– no obvious side effects. Guava shows how I finally dropped total cholesterol after 11 years of tracking it.

Supplements I’m taking

Name           Actual Form                          Quantity
-----------    --------------------------------     ----------
Calcium        calcium citrate                      500 mg        
Vitamin D3     cholecalciferol                      125 mcg
Vitamin K2     MK-4                                 100 mcg
Biotin         d-Biotin                             10 mg  
Folate         folic acid                           800 mcg                                
Fish Oil       D3 as cholecalciferol                25 mcg
Fish Oil       omega3                               1180 mg 
Fish Oil       EPA                                  665 mg
Fish Oil       DHA                                  445 mg
Fish Oil       Other                                70 mg
Statin         Simvastatin                          20 mg
NAD+           nicotinamide adeninr dinucleotide    1600 mg

Experience (travel, adventure, surprise, learning)

As I’ve accumulated more experiences, life has become more monotonous. While my past self was concerned (even afraid) of eventual monotony, today I appreciate the opportunity to pursue a continually refined routine. Monotony allows me to eat more healthily, exercise more often, waste less time on things I don’t deem important, and quickly course-correct if I’m disappointed in myself. However, I also believe it is important to be able to look back and quickly recall important/interesting moments. Due to our brain’s ability to constantly improve our pattern recognition, similar experiences are compressed together, becoming hard to recall individually without a relevant queue. On the other hand, novel experiences tend to invade my consciousness even when I’m not seeking them, providing a constant reminder of a life thoroughly lived. To this end, I try to accrue experience outside of my daily routine each year, though with time I’ve realized these infrequent events are also a routine but at a more macro scale (e.g. try to ski each season).

(score B) Ski 14 days: I spent 12 days on the slopes. Other than missing the number of days, it was a great season. I continue to get better at a sport that I encountered only 10 years ago, and which I only get to practice 60 hours a year. The fact that I crashed into a tree and broke a rib or two, doesn’t take away my sense of accomplishment! Last year I was anxious about not getting enough skiing and was considering renting an AirBnB for a month or two, which didn’t happen. Interestingly, this year I don’t have the same craving and have cut down my goal to only 10 days on the slopes. I still hope to do that at some point in the coming years.

(score C) Summer Festival: I enjoy attending an easy summer festival in California every year, usually Lightning in a Bottle, to which I’ve gone several times. These events combine EDM music, with camping, good food, and interactive activities. This year I mistakenly double-booked the same days to go to Greece with Julia, so I gifted the ticket to a colleague. Instead I went for one day to a hyper-local festival in Santa Barbara called Lucidy. I’ve attended Lucidity before, but the weather is not as good (this year was right after an intense rainstorm) and I prefer an environment with fewer people I already know.

(score A) Burning Man: I finally had my glorious return to Black Rock City, with Kevin, Iv, Dario, Chris, Narine, Naz, and some new friends. As usual, this involved a prep weekend in Reno earlier in the season, which was fun by itself. This year was notable in that we had heavy rain during the event and got “stuck”. It looked bad on the cover of the Financial Times of UK, but in reality we were just fine. Burning Man is right at the intersection of doomsday prepping and partying, so we got to have our cake and eat it too. Kevin left early to propose to Aly, so I was left in charge of the Brain Burn and had a blast driving it around the playa.

(score F) Backpacking Trip: This is my biggest disappointment of 2023. Condog delivered a fabulous plan for a backpacking trip, as usual, this time in Alaska. At first, I was very excited, but over the next several weeks I kept getting overwhelmed by the idea. Flights to Alaska suck. Arrivals and departures are in the middle of the night. It was a very rainy season, and I was dreading an infestation of mosquitoes, which Conor exacerbated with a video clip of the inside of his RV being massacred by skitters. There’s fucking bears in Alaska, like real Bears, not the cutesy things you see at Yosemite. I had a trip immediately preceding and immediately after the Alaska trip. And to top it all out, Patricia ended up in the ER for a kidney infection a couple days before I was scheduled to leave. She asked me not to go, and while 90% of the time I would have pushed back, this time I felt relief and cancelled last minute.

(score F) Mountaineering Trip: My good friend Dan Tobin has been trying to get me mountaineering again. This particular trip overlapped with a potential key meeting for a big transaction at work, so I cancelled. Dan is a great guy and I hope we have another adventure together some time. For a while, it seemed like I was going to get into mountaineering. It’s in my personal history, having grown up in the Andes. I’ve done some interesting trips, including Aconcagua, Shasta, Whitney, Dana, Shuksan, and a couple others. But then I slowed down. Maybe risk aversion, maybe I just hate waking up at 2 am for a 12 hour summit push. Not sure. With this sort of thing it’s helpful to have people that are into it. My last father in law, Ray, was awesome at planning trips for Briana and I. My former colleagues Tom and John were good adventure partners, but they are now trying to make it rich and are postponing happiness to another day.

Legacy (children, mentoring, enterprise)

I am fortunate to have my ~~step-~~daughter Julia in my life, in spite of not having biological children. For people outside of my family, I’ve had the highest impact when contributing to an individual’s professional development. This requires finding people that want to have that kind of relationship, and whose needs are aligned with my strengths. I used to think of this exercise as “mentoring”, but in hindsight I’ve also had impact on peers/friends in addition to mentees. I just enjoy talking about career and personal goals and will do it with anyone that will engage.

(score A) Adopt Julia: I thought it would be a nice gesture to make our relationship official, even if there aren’t practical advantages. The process was very easy. We avoided doing the adoption while she was underage, because even if a kid is 17 and they’ve been living with a step-parent for 5 years, the government will send child services and do the whole ordeal before an adoption is authorized. On the other hand, adopting an adult is trivially easy. So easy, that I suspect you could adopt someone without their knowledge.

(score B) Mentor 3 Young People: I often engage with high-potential young people. Either friends of my daughter, through extended relatives/friends, or people that I meet in colleges or high schools when I give talks. I also have a few educators who know I like to mentor and will introduce me someone from time to time. I am happy with the flow of introductions and I think I’m having a positive impact on each of these meetings. But I’d like to see some of the interactions repeat more than once or twice. I’ve had a few people with whom I met semi-regularly, but it was often me driving it, and when I stopped, it died out. I think good mentorship has to be two sided to become a potentially life-long relationship.

(score A) Be Helpful to th Professional Journey of 3 People : I feel great about this. I believe I was helpful to Alex, David, Phil, and Tumas on their CTO journeys. I helped Deven in his transition from investor analyst to operator. I’d like to think I supported Jen in taking over CEO at HeyTutor. And I hope some of the discussions I had with Heike were as helpful to her as they were for me. In all these cases, I’ve learned/gained just as much or more from my interactions with each of these people.

Misc Photos

More skiing

Snow in Santa Barbara

Flooding in Santa Barbara

I’m great with toddlers…

The Sierras

Dave’s Wedding

ScOp’s offsite

More Family Photos

More Burning Man

Schnurr’s Wedding

Ryan’s Wedding

NYC!

Diego visiting from Israel

Infinite Thought Leadership

2023-11-25T08:01:01+00:00

Traditionally the Turing test will attempt to validate whether the machine is as smart as a person. However, consider what makes you think a piece of text was generated by an LLM. More often than not, it probably comes down to the text sounding too clever, well-written, and very consistent in style.

For now, we can tell when something smells like an LLM, and at least for me, it’s off putting. Not because I don’t like engaging with Chat GPT– I love it. But if I assume every piece of thought leadership is an LLM with minimal guidance by the “author”, then why would I read anything you post or, if I do, attribute any credibility to the author?

What is the alternative though? Will 140 characters actually increase in prominence as a format? Will we have to record more videos with some sort of anti-generative verification? (I hope not, I hate recording videos) Will there be a renaissance of in-person events, or at least virtual+live? I want to know what all of you have to say, but I don’t want to deal with everyone becoming a cheap thought leader.

The reality is that each of us probably has very few sufficiently unique and interesting ideas. How do we focus on that?

On Over-Employment

2023-11-13T08:01:01+00:00

I recently discovered an article on overemployment, a term referring to holding multiple full-time jobs secretly. While some, often low-income individuals, juggle multiple part-time roles out of necessity, overemployment typically involves individuals in remote, white-collar, often tech-based, roles. The article portrays these employees positively, as exceptionally productive. I see it differently.

In my managerial experience, I’ve often contemplated the balance between input and output. Fundamentally, output is what counts. Companies value the results their employees produce, prioritizing high performers regardless of their commitment level. Culturally, focusing on results rather than time spent is preferable, creating a less intrusive work environment.

Yet, those experienced in managing diverse teams know this view is overly simplistic. For non-repetitive, creative tasks, results aren’t always measurable weekly. Complex tasks require iteration, with results appearing intermittently. While end results are crucial, they are preceded by significant effort and dedication – the input. This might sound complex, but it’s logical. Consider the Manhattan Project; its success wouldn’t have been possible if team members were splitting their focus with other jobs.

A leader’s role is to foster a productive work environment, involving hiring the right people, promoting a positive culture, setting unified team objectives, and more. Ironically, an excessive focus on quantifying results, while avoiding input metrics, can lead to an uncreative and stressful workplace. In such settings, employees become risk-averse, engage in unnecessary debates about task sizing and assignment, and innovation suffers.

Conversely, in an organization filled with dedicated individuals, input metrics become irrelevant. Everyone is focused on achieving challenging goals, knowing not all efforts will be successful. In such cultures, innovation often occurs outside traditional work hours and settings. A committed environment like this doesn’t obsess over employees’ comings and goings but requires daily dedication to maintain.

When teams view their work as mere transactions, the culture deteriorates. Leaders must promote behaviors that foster a non-transactional environment. Employees with multiple full-time jobs inherently challenge this culture and should be discouraged.

Portfolio Simulator

2023-10-21T08:01:01+00:00

This document is intended to help those interested in venture capital think about the asset class and potential for returns. Unless specified, all results are net to LPs, so carry of 20% is already discounted where appropriate.

As an exercise, I built a simulation that attempts to recreate historical performance tracked by Cambridge Associates. I took data from this 2020 report (page 14), and filtered it for the 10 year period between 2004 and 2013. I chose that period because the vintages surrounding the internet bubble show unusually high volatility, and discarded years closer to 2020 given that it takes about 7 years for the TVPI to stabilize.

Cambridge Associates Data

import tabulate

data = [
        [2004,1.69,1.72,1.20,1.82,0.82,63],
        [2005,1.68,1.66,1.41,2.07,0.91,61],
        [2006,1.69,1.56,1.59,1.95,0.75,78],
        [2007,2.29,2.38,1.76,2.93,1.31,68],
        [2008,1.77,1.71,1.40,2.21,1.09,64],
        [2009,2.10,2.12,1.75,2.46,1.30,23],
        [2010,3.21,2.80,2.12,3.41,1.41,48],
        [2011,2.48,2.26,1.90,2.71,1.39,44],
        [2012,2.21,2.48,1.73,2.37,1.35,55],
        [2013,2.02,2.01,1.81,2.26,1.34,59],
        ['Average',2.11, 2.07, 1.67, 2.42, 1.17, 56]
]

headers = ['Vintage','Pooled','Mean','Median','Upper Q','Lower Q','Funds']
table = tabulate.tabulate(data, headers=headers)
print(table)

Vintage      Pooled    Mean    Median    Upper Q    Lower Q    Funds
---------  --------  ------  --------  ---------  ---------  -------
         1.69    1.72      1.2        1.82       0.82       63
         1.68    1.66      1.41       2.07       0.91       61
         1.69    1.56      1.59       1.95       0.75       78
         2.29    2.38      1.76       2.93       1.31       68
         1.77    1.71      1.4        2.21       1.09       64
         2.1     2.12      1.75       2.46       1.3        23
         3.21    2.8       2.12       3.41       1.41       48
         2.48    2.26      1.9        2.71       1.39       44
         2.21    2.48      1.73       2.37       1.35       55
         2.02    2.01      1.81       2.26       1.34       59
Average        2.11    2.07      1.67       2.42       1.17       56

Simulation Rationale

The goal for our simulation is to output a distribution that matches the average row in the table above. There are two sets of parameters I will be adjusting in order to adjust the distribution. First, I will divide the set of possible outcomes into “failed”, “breakeven”, “triples”, “homerun”, and “unicorn”, and assign each a probability. Then, I will adjust the range of expected proceeds for each category.

After I feel good with the results, I will do a sanity check against other sources, such as 500 startups and Angel List

There are a few other important parameters:

Number of Investments: 24. As of right now, our first fund has 21 investments. Our next fund will be larger, and it seems possible to support 2 investments per partner per year. With 4 partners investing actively over 3 years, we get 24 investments.
Number of Funds: 4000. According to crunchbase, there are about 4000 active investors with more than 20 investments. This is important to get a sense of what the tails look like.
Fund size, fees, duration, follow-on capital: standard values based on our current fund. Note that follow-on capital is never deployed using this simple model.

Simulation

# 1. Import necessary libraries

import numpy as np
import pandas as pd
import tabulate
import matplotlib.pyplot as plt
import matplotlib.ticker as ticker

# 2. Define the investment parameters

np.random.seed(42) # for reproducibility
fund_size = 50e6
fees = 0.12
carry = 0.2
portcos = 24
duration_years = 7
investable_capital = fund_size * (1 - fees)
follow_on_percent = 0.20
primary_investments = investable_capital * (1 - follow_on_percent)
check_size = primary_investments / portcos
number_vcs = 4_000 # approximate number of VCs with more than 20 investments in the USA, according to crunchbase

# 3. Define the probabilities and returns for each outcome


probabilities = [0.59,  # "failed", 
                 0.15,  # "breakeven", 
                 0.15,  # "triples",
                 0.10,  # "homerun",
                 0.01]  #  "unicorn"

# 4. Simulate the returns for the 24 companies 4000 times
df = pd.DataFrame(columns=["multiples", "positions", "total"])

def get_net(x, carry=carry, fund_size=fund_size):
    """ 
    returns x minus carry when appropriate 
    works with numbers or np arrays
    """
    return (np.maximum(x-fund_size, 0) * (1-carry) + np.minimum(x, fund_size))

for _ in range(number_vcs):
    # Define a list of functions for each distribution
    distributions = [
        lambda: 0,  # failed investments return $0
        lambda: np.random.uniform(1, 1.25),  # breakeven return 1x - 1.25x
        lambda: np.random.uniform(2, 4),  # "triples" return 3x - 8x
        lambda: np.random.uniform(12, 20),  # homeruns return 6x - 20x
        lambda: np.random.uniform(50, 175)  # unicorns return 50x - 175x
    ]

    # Sample indices based on probabilities
    indices = np.random.choice(len(distributions), size=24, p=probabilities, replace=True)

    # Call the chosen function for each index to get the actual value
    multiples = np.array([distributions[i]() for i in indices])
    

    positions = multiples * check_size
    total = positions.sum()

    # Append the values to the DataFrame
    new_row = pd.DataFrame({
        "multiples": [multiples],
        "positions": [positions],
        "net_positions": [get_net(positions)],
        "total": [total],
        "net_total": [get_net(total)]
    })

    df = pd.concat([df, new_row], ignore_index=True)
    
df = df.sort_values(by="total")

Results

# 5. Calculate quartiles for the 100 simulations

def get_percentile(df, percentile):
    """ returns percentile from a sorted dataframe """
    return df.iloc[int(df.shape[0]*percentile)]

def get_irr(tvpi, years=duration_years):
    """ returns percentile from an ordered list """
    return (tvpi**(1/years)-1)*100

statistics = {f"{pcnt*100:,.0f}th": get_percentile(df,pcnt)['net_total']
               for pcnt in [0.25,0.50,0.75,0.90,0.95,0.99]}
statistics['Mean'] = df["net_total"].mean()
statistics['Top'] = df["net_total"].max()

# 6. Display the results

data = []
headers = ['Percentile', 'Net Proceeds', 'Net TVPI', f'IRR ({duration_years} yrs)']
for q, net_total in statistics.items():
    tvpi = net_total/fund_size
    irr = get_irr(tvpi)
    data.append([q, f"${net_total / 1_000_000:.0f} M", f"{tvpi:,.2f} x", f"{irr:.2f} %"])

# .INX	01/01/2004 $1,131.13	12/31/2013 $1,848.36
data.append(["S&P500", "N/A", "1.63 x", f"{get_irr(1.63):,.2f} %"])

table = tabulate.tabulate(data, headers=headers)
print(table)

Percentile    Net Proceeds    Net TVPI    IRR (7 yrs)
------------  --------------  ----------  -------------
25th          $58 M           1.16 x      2.08 %
50th          $80 M           1.61 x      6.99 %
75th          $120 M          2.41 x      13.37 %
90th          $219 M          4.39 x      23.52 %
95th          $257 M          5.14 x      26.35 %
99th          $353 M          7.06 x      32.21 %
Mean          $104 M          2.07 x      10.99 %
Top           $625 M          12.50 x     43.45 %
S&P500        N/A             1.63 x      7.23 %

Visualization

# 7. Visualize the results

plt.figure(figsize=(14, 7))
plt.hist(df['net_total'].tolist(), bins=50, color='blue', alpha=0.7)
for q, val in statistics.items():
    plt.axvline(val, color='red', linestyle='dashed', label=f"{q}: ${val/1e6:,.0f}M")
plt.title(f"Distribution of Returns over {number_vcs} Runs")
plt.xlabel("Net Proceeds")
plt.ylabel("Number of Runs")
plt.legend()

# Custom formatter for x-axis
formatter = ticker.FuncFormatter(lambda x, pos: f"${x/1e6:,.0f}M")
plt.gca().xaxis.set_major_formatter(formatter)

plt.grid(True, which="both", linestyle="--", linewidth=0.5)
plt.show()

Conclusion

The simulation shows a distribution consistent with the initial Cambridge Associates data, and also consistent with anecdotal data reported by experienced venture capitalists. For this particular period, the median fund (50th percentile) would have performed simmilarly to the S&P 500 index, whereas a fund at the top quartile would have almost doubled the IRR of the index. Because of the exponential nature of startup returns, funds at the 90th and 95th percentile perform extraordinarily well. Of course, these are different asset classes, with varying characteristics of risk, liquidity, and tax exposure (bless the QSBS).

From the chart above, we can see the distribution doesn’t decay smoothly, with a sudden drop at around $250M, which highlights some problems with this approach.

If you have any feedback or suggestions, please reach out.

Feel free to modify this Jupyter Notebook

On Dev Shops

2023-06-14T08:01:01+00:00

Companies often turn to software development consulting shops for a helping hand to build or enhance their products. At first glance, this appears to be a rational decision, but hidden beneath short-term benefits, there are darker repercussions with long-lasting implications that can significantly impact both the business and team dynamics.

Consider, for instance, the process of removing a consultant after they have accumulated institutional knowledge. The length of time they have worked for the company makes termination exponentially more difficult. As a result, consulting partnerships can rapidly settle into an awkward, entrenching pattern, and the time it takes to safely end the relationship could drag on for months.

The irony is that as more consultants are added, the consulting relationship expands rather than contracts. In the short term, companies are coaxed into fostering these relationships as they scratch the itch of necessity. However, in the long run, such decisions hinder their ability to evaluate and potentially end a consulting arrangement.

Over-confidence is yet another danger lurking beneath the surface of consultant relationships. Often, consultants are overly persuasive in convincing companies about their capacity to build a particular product or feature. They excel at selling themselves and their technical prowess, but this can lead to unrealistic expectations and inevitable disappointments.

Imagine the confusion a newly hired CTO would experience working alongside an existing consulting shop. Unclear leadership and lines of decision-making authority can prevent the CTO from gaining full control of the company’s technical direction. In such a situation, it is crucial to redefine reporting lines to concentrate decision making under the CTO.

The growing knowledge of your business and its problem spaces within the consulting shop turns into a double-edged sword. On one hand, it seems beneficial to have consultants with expertise in your industry, but on the other hand, it gives the consulting shop substantial leverage in negotiating with your company. The bargaining power they wield in comparison to individual employees can create an unstable environment from a financial and operational standpoint.

Often juggling an array of projects and clients, consultants are unlikely to invest the same commitment and passion into your company’s work as full-time employees do. Consequently, when you unmask the real extent of their commitment and effort, disappointment is almost inevitable.

Lastly, collaboration with an external dev shop inadvertently encourages project management methods that stifle productivity and breed confusion. The interface between the in-house and external teams becomes clouded, leading to tension and miscommunication. The uncertainty originating from this tension may jeopardize the overall project’s success and team morale.

The Hard Work Dilemma

2023-05-07T08:01:01+00:00

During a recent conversation with a colleague, we discussed the idea of work ethics and the impact they have on personal and professional development. This is a subject that has arisen frequently throughout my career in management, and it’s one that I find both intriguing and challenging.

I’ve noticed that individuals tend to fall on a wide spectrum when it comes to work ethics. Some are highly committed to their work, while others are more nonchalant about it. As a manager, it can be difficult to guide those who seem to be caught in the middle, struggling with a kind of cognitive dissonance. They may feel that society places too much emphasis on hard work and dedication, yet they also want the rewards that come with a strong commitment to their careers.

I remember a particular conversation, at Amazon, in which someone expressed that they didn’t prioritize work as much as the average employee. My response was that everyone has different priorities, and that’s fine. However, their counter argument was that this creates an unfair advantage for those who choose to invest more time and energy in their work. When I half-jokingly suggested that we enforce a strict 5 pm cutoff time for all employees to level the playing field, they actually agreed, which would be an oppressive restriction for others.

The root of this issue, I believe, lies in the tendency for people to view work in a purely transactional manner. It’s true that a job is fundamentally a transaction, with employees providing services in exchange for compensation. However, this narrow perspective overlooks the broader concept of work - the tasks and efforts we engage in outside of our professional lives, often without any monetary reward.

People might spend countless hours planning a wedding, helping a friend in need, or taking care of their children - all forms of work that don’t come with a paycheck. And yet, when it comes to their jobs, these same individuals may display a lack of enthusiasm or commitment to the quality of their work. This is somewhat paradoxical, as one might expect that the addition of financial incentives would lead to even greater dedication.

Of course, not all jobs are enjoyable, and there are undoubtedly tasks that we’d rather avoid. Even so, we willingly undertake unpaid work like changing diapers or taking out the trash. Moreover, many of the people I’ve managed have enjoyed working conditions that are far better than those experienced by the vast majority of the global population. It seems that there’s a psychological resistance to performing tasks out of necessity or obligation, as opposed to genuinely wanting to do them.

I find it curious that many individuals strive for competence and efficiency when assisting a friend or handling a family crisis, yet choose to be lackluster or even incompetent in their professional lives. This isn’t about working longer hours or putting oneself at a disadvantage; it’s about the decision to expend the same amount of energy on doing a task poorly as one would on doing it well.

My suggestion is to adopt a dual approach to one’s job. Most of the time, focus on doing good work, regardless of pay, management, or company mission. Strive to excel at the tasks at hand and learn as much as possible in the process. Periodically, perhaps twice a year, step back and evaluate the overall satisfaction with your job, including compensation, relationships with coworkers, and other factors. If this reflection reveals a desire for change, pursue it. If not, continue to do good work.

While I can only speak from my own experience, I believe that, all else being equal, a commitment to doing great work will lead to greater happiness and success in the long run for most employees.