London 2024 – Thursday

I’m pretty sure that this was the first day that I was feeling less jet lagged. I was so happy with my hotel choice. The Clermont at Charring Cross was located so perfectly that I was able to access everything that I wanted. I think the furthest that I went was the Tate Modern but even that was only 25 minutes or so away.

I visited the Roman Mithraeum which I loved. I walked past St. Paul’s which is quite an architectural specimen, as were many of the buildings in the area. I was on my way to an archaeological exhibit on the site of a roman temple from 2 millennium ago. The cult of Mithras was imported to Rome and the Zoroastrian heritage is still present. From there I crossed the Millennium Bridge where my seat mate from the ballet the night before had mentioned that I could find some “micro art” on the foot bridge. Sure enough, there were hundreds! I walked the southern shore line and was hoping for a good photo with the reflections in the water.

I was going to a presentation at the University of London in the evening, it was a book launch for a new title Legacies of Migration and a few of the chapter authors were there to discuss their topics. One of them was about Van Gogh and his year in London in his early 20s. The main thesis is, as you can imagine, that migration is a constant in London and the city benefits from its multicultural past and present. I spent the afternoon at the British Museum which is beside the building with the lecture hall.

London 2024 – Wednesday

Did I mention that I was tired? London is just such an exciting city. There is so much to see and explore! I went in search of the Noses of SoHo. I ended up finding 4… or 3 and a nail before I grabbed lunch and some sweets. Then the National Gallery, for a few hours. The National Gallery collection is like visiting an art history textbook. Room after room of amazing art. Massacio, Rembrandt, della Francesca, and Titian around every corner.

Manon was exquisite! I’m certainly no expert on ballet but I loved it. It was a very moving performance and the entire experience at the Royal Opera House was such a treat.

London 2024 – Tuesday

I was tired by day 2. Excited and tired. Im happy that breakfast was great to start the day. I probably had too much coffee but I had a big day ahead of me! The genesis of my trip was the Philip Guston exhibition at the Tate Modern. It was pretty amazing entering the first room and seeing his paintings in real life. I had only seen the one in the archives downstairs at our own Nation Gallery and its presence is felt with the texture of the brushstrokes and the size of the works. These are his murals and I wandered back and forth, room to room just awed by seeing this collection. I stopped at the Courtald for the Frank Auerbach show of his charcoals. Auerbach’s work that I also saw a few years back at the Tate Britain show is architectural in the way that he applies form, perhaps sculptural is a better word? The Courtald also has an amazing collection of the Impressionists and I lingered in front of the Cezanne for too long perhaps. And after a phenomenal dinner, (I’ll make a separate post just for food photos) I wandered the Parliament district is search of some black and white photo opportunities. My camera loved London.

London 2024 Day by Day – Monday

I arrived early in the morning and was grateful for an early check-in at the hotel. I took a friends advice and went to the Churchill Museum and wandered the sunny streets of London. Tosca was fantastic! I saw it in Ottawa with a friend years ago and was looking forward to this production.

London 2024 – some iPhone images

Wow! What a trip. It was only 5 days but I tried to pack a lot into it. I have imported all of my camera photos but always start with my mobile images since they are fewer. The flight, hotel, food, culture and entertainment were fantastic. And the Royal Opera House is a beautiful venue! Tosca, Manon and La Boheme were so beautiful to experience in a live setting.

The Evolving Landscape of Language Models: Exploring Reasoning, Learning, and Future Horizons

Rumination on Q* and what it could potential imply.

Q-learning and STaR are, I think, what OpenAI is talking about when it references Q*.

Language models’ capacity for nuanced reasoning has been a focal point of research. Enter the Self-Taught Reasoner (STaR), a groundbreaking technique that augments language models by integrating sparse rationale examples with vast datasets. This innovative approach fosters an iterative learning process, refining models to generate coherent chains of thought for diverse problem-solving tasks.

See STaR: Self-Taught Reasoner Bootstrapping Reasoning With Reasoning for more details.

The essence of STaR lies in its ability to fine-tune models based on the correctness of generated rationale. This iterative refinement loop catapults language models to not only achieve significant performance improvements but also to rival larger, more resource-intensive models on complex tasks like CommensenseQA. Does this mean that the model has surpassed the human results? From 56% on the original trials to equal 89%, the human performance, or more?

STaR’s success embodies a pivotal shift—a leap forward in language models’ autonomous reasoning. It sets a precedent for future advancements in bridging the gap between artificial intelligence and human-like cognition, redefining the boundaries of what these models can achieve.

Beyond STaR’s iterative prowess, insights gleaned from Q-learning and Markov chains provide critical guidance for scaling language models’ performance. Studies leveraging these concepts reveal a foreseeable decline in model performance as problem complexities increase.

Q-learning is a fundamental concept in reinforcement learning, a type of machine learning. It involves an algorithm that enables an agent to make decisions in an environment to achieve a specific goal. Through trial and error, Q-learning helps the agent learn the best action to take in a given state to maximize its cumulative reward. It does this by updating a Q-table, which stores the expected future rewards for each action in every possible state. Over time, the agent refines its actions based on the values in this table, gradually optimizing its decision-making process in complex environments without prior knowledge of the environment’s dynamics.

An aside – the implications of these insights underscore the necessity of strategically balancing computational resources during both training and testing phases. This balancing act becomes imperative for ensuring sustained model performance across a spectrum of intricate problem landscapes. The parallel nature of these once linear processes is where my interests lie*.

* For those asking for clarification, this has to do with Douglas Hofstadter’s work Gödel, Escher, Bach that discusses a cybernetic hierarchy comprised of a hierarchical “stack” of instructions that carry out functions. For Hofstadter a program that rewrites itself violates this hierarchy.

Consider a scenario where language models seamlessly engage in real-time problem-solving during emergencies, prioritizing resource allocation akin to a human decision-making process. These insights lay the groundwork for future innovations, enabling language models to navigate diverse problem spaces with enhanced adaptability and efficacy. But how future? What defines the constantly shifting reward modeling? How does it allocate rewarding?

Language models, once confined to simple word predictions and text generation, have undergone a paradigm shift. They now navigate intricate reasoning tasks, delve into problem-solving domains, and strive towards human-like cognitive capabilities.

The journey towards refining reasoning capabilities extends into the domain of mathematical problem-solving—a seemingly straightforward yet challenging realm for language models. The GSM8K dataset encapsulates this complexity, revealing the struggle even formidable transformer models face in navigating grade school math problems.

To overcome this hurdle, researchers advocate for training verifiers to scrutinize model-generated solutions. The success of these verification mechanisms showcases their potency in augmenting model performance, especially in handling diverse problem distributions. This essentially not only increases the frequency but also the total distribution of rewards in any space, a clustering of rewards. Makes sense, this mirrors real world learning.

In the pursuit of refining reasoning capabilities, the exploration of supervision techniques emerges as a pivotal aspect. A comprehensive investigation into outcome and process supervision reveals the latter’s superiority in training models for intricate problem domains. Checking each step of a process, enabling reward reinforces accuracy rates.

Process supervision, with its meticulous feedback mechanism for intermediate reasoning steps, exhibits unparalleled reliability and precision. When coupled with active learning methodologies, exemplified by the release of PRM800K, this supervision approach propels related research endeavors, promising a robust foundation for future advancements.

Consider a scenario where these models assist in personalized education, adapting to individual learning styles, or co-create narratives alongside authors, blurring the lines between artificial and human creativity. The potential for language models to revolutionize domains extends far beyond what we envision today.

Imagine language models not just deciphering language but engaging in philosophical discussions about complex moral dilemmas or even participating in real-time collaborative problem-solving scenarios during crises. And I think that a lot of the discussion about the “Crossing of the Rubicon” in the miasma of the last week at OpenAI revolves around the fact that now capable, the ethical “wrapper” is a shadow but imperative. Their ability to actively engage in profound ethical debates remains a nascent area.

Envision language models not just decoding textual content but understanding the depth and nuances of moral quandaries. Imagine a scenario where a language model is posed with a complex moral dilemma, such as the classic “trolley problem,” where decisions involve choosing between utilitarian principles and individual rights. The model, armed with extensive knowledge of ethical theories and moral reasoning, would not only parse the scenario but engage in a dialogue, weighing the pros and cons of different ethical frameworks and articulating its stance on the matter.

For instance, such a model could explore various ethical perspectives—utilitarianism, deontology, virtue ethics, or ethical relativism—articulating arguments, counterarguments, and the implications of each stance. It could draw from historical ethical debates, ethical principles, and even contemporary ethical dilemmas to contextualize its responses.

The implications of this extend far beyond theoretical discourse. Language models proficient in ethical reasoning could aid in decision-making processes across diverse fields. They could assist in ethical assessments in various industries, offer guidance in moral reasoning to individuals facing ethical quandaries, or serve as a tool for educators to facilitate discussions on ethics and morality.

However, such advancements raise profound questions and challenges. Ethical reasoning is inherently complex and often involves subjective considerations, societal norms, cultural context, and emotional intelligence—factors that are intricate for machines to grasp fully. The ethical development of such models would necessitate a deep understanding of not just logic but empathy, context, and the ability to comprehend the subjective nature of human ethical reasoning.

Moreover, the ethical implications of deploying such models into real-world decision-making contexts warrant careful consideration. How would we ensure the models’ reasoning aligns with societal values? How do we mitigate biases or unintended consequences in their ethical assessments?

Future innovations might unveil models that not only traverse language intricacies but also navigate philosophical landscapes, challenging societal norms, and catalyzing groundbreaking innovations across diverse domains. These reflections offer a glimpse into a future where language models not only emulate human-like reasoning but also shape the realms they interact with.

The landscape of language models has traversed a remarkable journey—from simple text generation to sophisticated reasoning and problem-solving. The advent of methodologies like STaR, insights from Q-learning and Markov chains, and the exploration of supervision techniques have thrust these models into realms once deemed unattainable.

As these advancements continue, the horizon of possibilities expands, offering a glimpse into a future where language models not only comprehend language intricacies but also engage in profound philosophical discourse, challenge societal norms, and catalyze innovative breakthroughs. The journey of language models is an ongoing exploration, promising exciting possibilities and transformative impact across various domains.

Naples, Palermo and Rome with my Holga pinhole 50mm

I decided to rummage through my photos for the ones that I took with my Holga lens. The grit of Naples and Palermo, let alone the Pantheon on a dark and rainy night in Rome, made for some nice black and white pictures.

Some first images from Rome.

And some art by Caravaggio and Michelangelo.

Canada and Immigrants from Europe post 1945

The Canadian government was recently vilified for the appearance and recognition in Parliament of an immigrant from Ukraine who served in the Galicia Division that was formed in 1943, made up of Ukrainian volunteers to fight against the Soviet Union. I decided to look into the text of the 1985-1986 DeschĂȘnes Commission that was digitized in 2012 by Privy Council Office to educate myself on this issue. This Commission was established specifically to report on the issue of war criminals in Canada and included discussion of this specific movement. Text is provided for Part I that was designed to be published with Part II, destined to remain confidential although there are calls to have this released. The number of war criminals in Canada ranged from a “handful” to 6000 at the time of the Commission. It should be noted that outside interveners in this Commission never quoted a figure under 1000.

The Report shows the geo-political role played by Canada at the end of the War in Europe, suffering immensely due to the economic collapse of decades of hostilities and war. All Europe was now moving, in one direction or another, and Canada’s economic goals aligned with geo-political goals of managing the recovery of Europe through the acceptance of large numbers of persons displaced by war, even if many of these policies originated in Washington or London. The Report also shows the limitations of Canadian authority as it emerged from its subordinate Dominion status. The accoutrements of a mature state were not yet in place.

As a (barely) former colony of the British Crown, Canada only enacted an independent Immigration Act in 1952 and decisions made by Cabinet in this instance must be considered using this lens. Independent foreign policy was less than two decades old at the close of hostilities. Immigration was subsumed under Order in Council P.C. 1931-695, a tightly restrictive policy. The Citizenship Act of 1947 was a step away from Canada’s status as dependent but a “Canadian” immigration act was still years away. Immigration was controlled from the centre, as it was in 1919, policies put in place after the Great War.

I’ll also add that since Canada was not a signatory to the Charter or the 1945 London Agreement, it didn’t have any jurisdiction over this type of offence. Canada still did not have fully developed State authority over elements that would have been needed here. This was only to come over the next decade. As the report makes clear, there was no attempt to hide the fact that these persons were members of the German military that Canada was at war with so no basis to claim that these people hid their histories and overturn their status in Canada.

Canada began to play a key role in the emergent Cold War of the late 1940s as a “release valve” for emigrants from across Europe, including those who lived in former Nazi and fascist controlled areas across Europe. Immigrants from all over the continent wanted to immigrate out of Europe, many of them living years directly or indirectly under fascist rule, suffering oppression and genocide. The initial years of the post-war were dedicated to returning Canadian service personnel from across the globe. Immigration as we know it was non-existent and “who” could immigrate was moot. As order returned, so too did the desire to manage this migration for state goals. Demand was coming both from our allies in Europe and the US, along with Canadians who wanted their family members to join them here, not to mention economic growth. Economic growth as a key policy priority was evident in the 1947 agreement with the UN Relief and Rehabilitation Agency and the International Refugee Organization to bring people from Europe as contract workers in specific labour markets.

In June 1949 all immigration was restricted by Order in Council 2743 to those with relatives in Canada, citizens of the UK, USA, and France and agriculturalists, miners, lumberers, loggers and domestic workers. These occupations did not, of course, have Occupational Codes since no system such as the National Occupational Classification or any occupational codes were in place. This data was not to be collected for decades. But I digress. Important for us here is that the Canadian government had denied the entrance of Ukrainians currently held in the UK including the Galicia / Halychyna Division.

This Ukrainian division had surrendered to British forces in May 1945 and were held in Rimini until 1946 when they were screened and transferred to the UK. After the repeal of P.C. 2743 in 1950 by P.C. 2856 there was a change in policy to accept, among others, this group that was currently held in the UK noting that they still required screening both for the immigrant and the applicant in Canada, if applicable. The Canadian Jewish Congress objected and Ottawa paused.

The British Foreign Office stated that both Soviet and British missions had no evidence that these persons fought against Western Allies or engaged in crimes against humanity. And while the CJC continued its objection, Canada opened the doors to Ukrainian immigration from the UK. According to the DeschĂȘnes Commission, approximately 600 of these former members of the Galicia Division were in Canada in the mid 1980s. Simon Wiesenthal gave a list of 217 specific names to then Solicitor General Robert Kaplan. This list was investigated and of those, over 86% never set foot in Canada and the few that did had no specific accusations leveled against them. Investigations by both the RCMP and the Commission reached the same conclusion.

I invite you to read the report. It is an interesting story of immigration to Canada that reaches into our own day with the resignation of the Speaker of the House last month.