MyOwnPirateRadio

I swear, I didn’t work on Windows 7

November 5, 2009 · Leave a Comment

In the silly-marketing-antics department, my buddy Scott forwarded this screenshot around yesterday. It’s from  Steve Ballmer’s Windows 7 launch keynote slides.

Win7_Launch

In that main triangle of people in the center, the back three rows or so is lifted directly from a photo of the Live Search team. I’m in it — back row, 3rd from the left — so it must have been taken in late 2004 or early 2005.

Scott’s in it too, from many moons ago when he worked at Microsoft. In fact he’s in there at least twice.

Can’t wait for my ship-it award!

→ Leave a CommentCategories: Uncategorized
Tagged: , ,

Join the Toronto Open Data Community

October 11, 2009 · 1 Comment

Toronto, we have lift-off!

The City is hosting an “Open Data Lab” on Nov 2 to kick off their community engagement on open data.

The Open Data Lab is an opportunity to explore the innovation possibilities of open civic data in Toronto. Join City subject matter and technology experts, community stakeholders and talented members of Toronto’s vibrant technology and design communities in an interactive and collaborative afternoon imagining commercial, social and civic applications of the City’s newly launched open data program.

Let’s get started.

→ 1 CommentCategories: Uncategorized

Mint CEO on Startup Building

October 11, 2009 · 1 Comment

Mint.com CEO Aaron Patzer recently did a great presentation called “Everything you wanted to know about startup building, but were afraid to ask“. In it he chronicles the development of Mint.com from the germ of an idea all the way to exit. (Mint sold to Intuit for $170M.)

It’s not just an interesting story. He talks in fine detail about costs, salaries he paid himself and employees, equity, business model… all the things you’d need to know if you wanted to build your own company. This sort of detail is usually kept hush-hush.

The accompanying slides are here.

Every startup founder should watch this.

→ 1 CommentCategories: Uncategorized

New York City is doing Open Data too

October 8, 2009 · Leave a Comment

With an apps competition, no less.

The New York Times says:

Contestants will have access to more than 170 data sets supplied by over 30 city agencies, including weekly traffic updates, schedules of citywide events, property sales, restaurant inspections and mappable data around school and voting districts.

Exciting!

We ought to do something of similar spirit in Toronto, once people have had a chance to dogfood a version 1 “sandbox” release of open data.

See also: Open Data: What’s on Your Wish List?, Open Data: Making Toronto a Better Place to Live

→ Leave a CommentCategories: Uncategorized

5 Blocks Out has a Blog

September 29, 2009 · Leave a Comment

Katrin and I have been working for a while now on a project called “5 Blocks Out“. We’ve just launched a blog for it, which you can find at blog.5blocksout.com.

The 5 Blocks Out Blog will mainly chronicle the development of the site.We’ll also post some city-related musings from time to time, similar in spirit to the posts Katrin has been writing on the Mukodu Blog.

Currently 5BlocksOut.com is in the early stages of development – but already a great deal of fun. So whether you are interested in hearing about early stage start-ups, adventures in the City or how to get the most out of the site, the blog should have something for you.

Send me a line if you would like to learn more. Or check out the new blog as it gains momentum.

→ Leave a CommentCategories: Uncategorized

Open Data: What’s on Your Wish List?

September 16, 2009 · 4 Comments

Fall is here, and I’m eagerly awaiting the sandbox and first release of City of Toronto “open data”. Since I work every day with data on 5 Blocks Out I’ve been thinking about what I would like to see opened up. In that spirit, here’s a short personal wish list.
First, some general “tenet” wishes:
- Publish all data in machine-readable open standard formats instead of just PDFs and unstructured text. JSON, XML, CSV, iCal, etc.
- Publish standard format street addresses, at least, for all location-based data. Better yet is a latitude/longitude pair.
- Publish time-based information such as events in a calendar format such as iCal.
- Any refreshable dataset needs unique durable IDs for every object in the data so readers can detect changes over time.
- Document the data at least enough for people to understand and use it productively.
Second, a few specific data sets that would be very useful:
Lists of places, including place name and location information. A foundational part of what we’re doing on 5 Blocks Out involves situating thousands of places on maps. We’re interested in all kinds of places… businesses, private organizations, and government facilities such as parks, community centres, and libraries. To do this we need trustworthy data sources with at least a name and location for each place.  Ideally the location is already stated as a latitude/longitude, but street addresses also suffice as they can be geocoded into latitude/longitude using various free geocoding web services. << geocoder, google >>
In addition to a place name and location we try to describe each place. For example, if it’s a business or a private organization, what sort of products and services does it provide? If it’s a government facility, what services does it offer to the public? How might one contact this place via phone, email, or fax? Is there a website URL available? And so on. This information is useful, but not essential.
Lastly, we look for unique identifiers, so that we can tell places apart and identify changes in place information over time. For instance, if a business moves from 123 Main Street to 245 Main Street, how do we know it’s the same business? Some data sources publish unique ID information that enable us to detect this sort of change. It turns out that  without unique IDs to rely on you need to come up with funky duplicate detection heuristics.
Here are examples of name-and-location information data sources the City of Toronto publishes today that we would love to have in easily machine-readable format:
- DineSafe lists restaurant names & addresses along with inspection data http://app.toronto.ca/food2/DineSafeMain
- Community centres: http://www.toronto.ca/parks/recreation_facilities/community-centres/index.htm
- Parks: http://www.toronto.ca/parks/parks_gardens.htm
Event data. Like many websites, we’re interested in publishing calendars of events happening throughout the city. We define “event” broadly to mean anything interesting that is time-based. An event might be a major street festival, a running race, a sporting event, a concert, or a city councillor doing a public consultation meeting. Jon Udell has done some great work on this with his ElmCity project; check it out if you’re publishing an event calendar already, or thinking about it. We endorse Jon’s idea of publishing in iCal and related formats.  We’d like to see this go a step further and have each event item include a location, as described above.
Again, the City of Toronto already publishes event data, just not in a machine-readable format. Here’s the Toronto Festivals and Events Calendar, for example http://wx.toronto.ca/festevents.nsf/.
TTC route, stop, and vehicle location information. This sort of data is obviously useful for building all kinds of apps that help people get around the city. Kieran and Kevin have done a great job reverse-engineering TTC route and stop timing data on http://myttc.ca. TTC should provide an official data stream to enable apps like theirs. The data is already in PDF format as route schedules.
There’s lots more on my list, but these are near the top.
What’s on your wish list?

Fall is here, and I’m eagerly awaiting the first release of City of Toronto “open data”. I’ve been thinking about what I’d like to see offered, both for data-hungry citizens in general and, more greedily, for accelerating our progress on 5 Blocks Out. In that spirit, here’s a short wish list.

First, some general “tenet” suggestions:

  • If you’re publishing it for humans, publish it for machines too. We need data in machine-readable open standard formats like JSON, XML, CSV, iCal, and so on. Not just PDF.
  • Publish standard format street addresses, at minimum, for all location-based data. Better yet is a latitude/longitude pair.
  • Publish time-based information such as events in a calendar format such as iCal.
  • Any refreshable dataset needs unique durable IDs for every object in the data set so that machine readers can detect changes over time.
  • Document the data at least enough for people to understand and use it productively. This sounds like a no-brainer, but apparently it has been a blocking issue in use of open data in other cities.

Second, here are a few specific data sets I would find useful for the work I’m doing:

1. Lists of places, including place name and location information.

A foundational part of what we do on 5 Blocks Out involves situating thousands of places on maps. We’re interested in all kinds of places, including businesses, private organizations, and government facilities such as parks, community centres, and libraries. To do this we need trustworthy data sources with at least a name and location for each place.  Ideally the location is already stated as a latitude/longitude, but street addresses also suffice as they can be geocoded into latitude/longitude using various free geocoding web services.

In addition to a place name and location we try to describe each place. For example, if it’s a business or a private organization, what sort of products and services does it provide? If it’s a government facility, what services does it offer to the public? How might one contact this place via phone, email, or fax? Is there a website URL available? And so on. This descriptive info is useful, but not essential.

Lastly, we look for unique identifiers, so that we can tell places apart and identify changes in place information over time. For instance, if a business moves from 123 Main Street to 245 Main Street, how do we know it’s the same business? Some data sources include unique ID information that enable us to detect this sort of change. It turns out that  without unique IDs to rely on you need to come up with funky duplicate detection heuristics.

Here are examples of name-and-location information data sources the City of Toronto publishes today that we would love to have in easily machine-readable format:

- DineSafe lists restaurant names & addresses along with inspection data

- Community centres

- Parks

2. Event data

We’re interested in publishing calendars of events happening throughout the city. We define “event” broadly to mean anything interesting that is time-based… everything from a major street festival, to a sporting event, to a city councillor doing a public consultation meeting. Jon Udell has done some great work on this with his ElmCity project; check it out if you’re publishing an event calendar already, or thinking about it. We endorse Jon’s idea of publishing in iCal and related formats.  We’d like to see this go a step further and have each event item include a location, as described above.

Again, the City of Toronto already publishes some event data, but it’s not in an easily machine-readable format. Here’s the Toronto Festivals and Events Calendar, for example. (Yes, we could build a parser to consume this particular web page, but that would be missing the point.)

3. TTC route, stop, and vehicle location information.

This sort of data is obviously useful for building all kinds of apps that help people get around the city. Kieran and Kevin have done a great job reverse-engineering TTC route and stop timing data on MyTTC.ca. The TTC should provide an official data stream to enable apps like theirs. TTC already offers the data in PDF format as route schedules.

There’s lots more on my list, but these three buckets are near the top.

What’s on your wish list?

→ 4 CommentsCategories: Uncategorized

Open Data: Making Toronto a Better Place to Live

August 14, 2009 · 5 Comments

Several months back Toronto mayor David Miller announced the city would embark on an “Open Data” initiative, with first steps to show by this fall. Well, fall fast approaches, and the city’s Open Data website is still a blank slate. While we don’t know yet what Open Data will be, lots of people have notions of what it ought to be. Here’s mine:

Start with a Sandbox and some Dogfood

It’s been a difficult summer for the city. The strike took a lot of resources offline, including people who would otherwise have been helping formulate and deliver the first batch of data. So while fall is probably still doable, plans for a first release must surely have been scaled back. The first go-round will have to focus on low-hanging fruit: data that happens to be readily available, privacy-clean, politically non-threatening, and already in machine-readable format.

Let’s also recognize that the first release, like any version 1, wants to be a pilot / proof of concept, not a polished product. I imagine the city will publish some sample data feeds (the “dogfood”), encourage people to build a few apps that consume the data (“dogfooding”), and then evolve a repeatable process around that while putting together a viable longer-term plan. That would be just ducky.

Beyond the first release, getting the city into the business of publishing and consuming data is a huge challenge. Technology is the least of the difficulties. It’s a huge prioritization problem, for one… the city needs to develop clear tenets, guidelines, and processes for deciding which requests to bubble to the top of the stack. And there are many “soft” barriers to overcome, including union fears (must automation lead to job losses?), privacy concerns, liability risks, and — probably most difficult — the turf struggles that will surely arise from trying to pry data out of people’s hands.

But this is not a blog post about Fear, Uncertainty, and Doubt. It’s about a happy world where the city overcomes its inertia, rises to the challenge, and Does Great Things. So let’s consider the question of what the data itself should be.

A Framework for Data Selection

If I was running the show, my framework for data selection would look something like this:

  1. Solve real people’s problems: focus on data that real people are requesting in order to solve real world problems. Ignore data that’s “looking for a problem to solve”, even if that data happens to be convenient to obtain and process. In other words, stay customer- and solution-driven, not expediency- and politically-driven.
  2. Satisfy the customers: those “real people” we want to satisfy break down into three groups: citizens, non-government organizations (both for-profit and not-for-profit), and government itself.
  3. Produce net benefit to society: the data’s benefits to society at large should outweigh the data processing costs. Benefit will be hard to measure in some cases. In other cases the benefit will be crystal clear in terms of dollars, e.g. money saved, time saved. Either way I say let’s measure, and get better at measuring, so that we can set goals and quantify our progress over time.
  4. Keep it clean: obviously the data must be OK to release from a privacy perspective, and it shouldn’t expose the city to unreasonable legal risk. That said, I would be perfectly happy with a license that exempted the city from all liability due to things like errors in the data, and I bet a lot of other people and companies would too. After all, we’ve signed a bunch of other licenses just like that for most other online data services we consume, including mission critical services like email and online document storage.
  5. Keep it fresh: the data can (and indeed, must) be refreshed periodically so that it doesn’t go stale. That implies an up-front commitment to continual publishing. Open Data isn’t a one-shot deal.

Open Data = Data In + Data Out

Almost all the examples I’ve read about open data initiatives are “Data Out”, i.e. cities publishing municipal data such as budget and contract details, service records for road repair, traffic flow, and so on, for the general public to consume. This is useful and necessary stuff, but there’s another equally important category I’ll refer to as “Data In”.

Data In is about society at large publishing data which the city consumes. For example, citizens noting the location of major potholes and failed streetlights; community service organizations reporting on how many people they are reaching, and how effectively (an idea Jane Zhang at TechSoup Canada is passionate about); schools reporting student attendance numbers, and so on. There’s a massive amount of “scouting” that can be done by citizens on behalf of the city, in effect crowd-sourcing information to help the city operate more efficiently and decide where to focus its limited resources. Citizens are incented to do it because they want their tax dollars spent efficiently.

“Data In” is the reason I list government itself as one of the key Open Data customers. As part of the planning process the city should be asking each of its departments for their own wish lists of data that society at large could provide in order to help them do their jobs better. Furthermore, those departments should be dogfooding the exact same data services that we the public consume. This process — internal dogfooding, and being your own customer — has a powerful built-in bias towards self-correction and accountability. You can bet the quality of city-published data feeds will be high, for instance, if internal city processes depend on those same feeds.

More to come…

I’ll write more about Open Data in the coming months. I’m selfishly hoping the city will publish some data we find useful for 5 Blocks Out, if only to save us from transcoding information trapped in PDFs (what’s with disabling copy-paste in PDFs?), and from hearing “Sorry, you’ll have to file a Access to Information Request Form for that” when we call our friends at City Hall. We can do better. Much better. Onwards!

→ 5 CommentsCategories: technology and society

MacBook iPhone Tethering Fix

July 13, 2009 · 1 Comment

I recently had some trouble getting my MacBook to connect to the Internet via an iPhone over Bluetooth. It worked fine for a while, then began disconnecting only a second or two after connecting. If you’re having the same problem, read on. If not, skip this post.

Context: My MacBook runs OS 10.5.7, and I’m trying to connect to the Internet via the Bluetooth connection and the “tethering” feature on an iPhone. The phone runs on the Rogers network.

I first went through the standard setup for tethering — turn on tethering, turn on Bluetooth, “pair” your devices — and everything went perfectly. Then, about a day later, things stopped working. The “Connect to Network” entry on the Bluetooth menu became grayed out. The Bluetooth Preferences showed the MacBook/iPhone connection fnctioning briefly when you manually turned the connection on, and then disconnecting after a few seconds. Green light for two seconds, then red light.

I am pretty sure the problem started after I plugged the iPhone in with the USB cable and then accepted prompts to create a new network connection. Everything went pear-shaped from that point on. I tried removing the network connections through the Bluetooth Preferences and Network Preferences UIs, to no avail.

After much digging, here’s a recipe that  solved the problem for me:

[on MacBook] Remove the device pairing: open Bluetooth preferences, select the iPhone entry, click the minus sign.

[on iPhone] Remove the device pairing for your Macbook: Settings, General, Bluetooth, select the device, click ‘Forget this Device’.

[on MacBook] Rename the directory /Library/Preferences/SystemConfiguration to something else, e.g. SystemConfiguration-old.

[on MacBook] Reboot. Mac OS will rebuild the SystemConfiguration for you.

[on MacBook] Run through the standard iPhone tethering setup steps again.

You might want to give it a shot if nothing else works. I assume no liability for any damage you might do.

Other references that might help you:

Good luck. Tethering is great when it works!

→ 1 CommentCategories: technology
Tagged: , , ,

Spread the Word: City of Toronto Launches Urban Fellows Program

April 29, 2009 · 1 Comment

City of Toronto 175 Years

One of the reasons I love living in Toronto at this particular time is the growing energy going into making the city a truly great place to live. There’s an increasing interest amongst everyday citizens in civic issues: topics like housing, transit, streetscapes, art, outdoor life, pollution, and economic vitality are fast becoming part of everyone’s sphere of interest.[1] And just as importantly, there’s an increasing willingness and capacity to change things. Unlike many other cities I’ve visited, Toronto is a place where you can actually change the way the city works, and accomplish it in your lifetime. It’s a huge reason to live here.

If this line of thinking resonates with you, and you’ve been seeking ways to get more engaged within the city, there is a program you need to know about: The City of Toronto is launching the “Urban Fellows Program“, an initiative aimed at attracting new high caliber professionals to the Toronto Public Service.

As I understand it, it’s one half boot-camp, one half incubator for smart people who want to make the city better. Participants get “an intensive introduction to the governance, operations and administration of Canada’s largest city through a combination of full-time work experience and participation in a series of seminars, tours and workshops.”

The program is one year long, with two six-month rotations in city positions. They’re seeking Masters – and Ph.D.-level experience, although that doesn’t seem to be an absolute requirement… I read it as, “we want whip-smart, well-educated people who are fired up about making the city better”. There are a limited number of positions. And it’s paid: the salary is almost $62K, some serious cash.

I love this concept, and I hope they net some really great thinkers. Applications are due may 30, and the first cohort starts this September. Please help spread the word.

[1] I readily admit to being biased by the people I surround myself with.

→ 1 CommentCategories: Uncategorized
Tagged: , ,

Next Post

April 16, 2009 · Comments Off

David Crow posted thoughts on StartupNorth about startup incubators and why we don’t have one in Toronto. As he points out, funding an incubator program is a big challenge. There aren’t enough Angels and VCs around willing to risk money on very early stage ventures here, and the ever-decreasing amounts of capital needed by tech startups look less and less attractive to investors with big chunks of money to manage. So if we’re to have a farm team, in Rick Segal’s words, how do we fund it?

I believe Toronto has both the financial and intellectual capital needed to do this.  Given that we’re having trouble getting bigger investors to fund this sort of effort, I wondered in reply whether micro-financing might be a viable alternative:

What if we tried micro-funding instead of the current approach? That might net enough investors to make it viable. We create a fund that pays for operating one session (or one year) of the program from start to finish. Price shares at, say, $5,000 apiece. Standardize the share terms so there’s no negotiation involved. Entrepreneurs offer up a fixed amount of equity in exchange for program participation. Investors share in the entrepreneurs’ risk and reward.

Who would buy? Well, at that price, I’d buy a share. I bet at least a few hundred other people would too. Wealthy investors (incl. some Angels) might purchase tens or hundreds of shares. Forward-thinking corps and a VC or two looking for higher-risk investments would buy in, and get good PR as a result. Maybe even the government buys some shares, or provides a tax incentive to others for buying. If the terms are suitable, even investors in other countries could participate.

Could we sell 2000 shares at that price? $5,000 x 2000 shares = $10M.

$10M could buy you an awfully big farm team, or even better, many cohorts of a small farm team.

Would you buy a share?

Comments OffCategories: Uncategorized
Tagged: , , ,