Showing posts with label open data. Show all posts
Showing posts with label open data. Show all posts

Sunday, June 27, 2010

Questions I Should Be Able to Ask My City (On the Web)

It's too hard to ask a simple question about a city and get a simple answer.


This notion goes well beyond satisfying the need to know the answers. Our inability to ask basic questions about any city and provide simple answers is a much more profound shortcoming compared to just the answers themselves. Simple answers to basic questions are the foundational elements of more complex analyses.

When we rely on experts and representatives, elected or otherwise, to perform these analyses we expect accuracy and completeness. But how is this measured? No matter what the field of study, the answer has always been references, data sets, and peer review.

Today, especially in matters of public interest, the word "peer" is hard to define. All the more reason why satisfying the first two elements, references and data sets, is of paramount importance.

Cities should provide simple answers to basic questions.

Cities should provide simple answer to basic questions in a URL-addressable, well-formed, text format.

There are many mechanisms cities can use to satisfy this requirement; Web API, online spreadsheet, posted XML documents. Ideally, technology exists and can be integrated so that cities can answer questions via several of these mechanisms.

So what are the questions we would ask our cities, if we could? Following are a few to which I would like to know the answers. Moreover, given the answers to these questions in URL-addressable, well-formed, text format(s) the answers to these questions could make for some rather interesting, useful mash-ups.
  1. Where is the location of each road and intersection under construction in my city?
  2. How long has road/intersection X been under construction?
  3. When will road/intersection X construction project be complete?
  4. What is the list of each building construction permit in my city?
  5. Where is the location of each building construction permit in my city?
  6. What is the list of businesses that pay taxes in my city?
  7. What is the list of businesses that pay taxes in my city, by type of business?
  8. What is the list of how much each business paid in taxes in my city?
  9. Where is the location of each business that pays taxes in my city?
  10. What is the list of neighborhoods in my city?
  11. How many houses are in each neighborhood in my city?
  12. What is the list of schools in the city? (This should include private and church schools, too, if these schools are required to be registered with the city.)
  13. What is the location of each school in my city?
  14. What is the enrollment at each school in my city?
  15. How many cars are registered in my city?
  16. What is the list of most traveled streets in my city?
  17. What is the list of traffic accidents in my city?
  18. Where are the locations of each traffic accident in my city?
  19. How many pounds of garbage are collected in my city?
  20. How many pounds of garbage are collected in my city, by zipcode?
  21. How many pounds of garbage are collected in my city, by neighborhood?
  22. What is the list of events in my city?
  23. When are events occurring in my city, by time period?
  24. Where are events occurring in my city?
  25. What is the list of conditional use permits in my city?
  26. What is the list of all emergency response stations (fire, police, medical) in my city?
  27. Where are the locations of each emergency response station in my city?
  28. What is the number of emergency calls in my city, by time period?
  29. What is the list of stations responding to emergency calls, by time period?
  30. What is the frequency of emergency calls by station and time period?
[To be continued...]

Without a doubt, this list is far from exhaustive. Crime data is one of the most popular and reported-on data sets. I tried to come up with relevant questions for which data are not likely accessible. Certainly, schema are required for declaring the answers to these questions in our required format. Probably every question does not have to be anticipated a priori. We should also not assume that a particular question implies a specific data set. We often can easily get the answer of, for example, the location of each crime. Rarely, however, can we get aggregate data, useful in trend analyses. Perhaps what we need are structured data sets and Wolfram Alpha. In any case, a foundational platform of data is required. In any case, we should be able to formulate a basic question and get a simple answer (on the Web).

Given the simplicity of what is being asked, however, this should not be complicated.

Saturday, November 22, 2008

Priorities for Technology and Innovation?

If I could choose our country's top five strategic priorities for technology and innovation, they would be:

  1. Diversify the energy ecosystem
  2. Transform human transportation
  3. Open more government data
  4. Boost resurgence in manufacturing
  5. Transform farming practices
Diversifying the energy ecosystem means that we diversify production and consumption.  Diversifying production means that we increase the number of different means for producing energy in terms of both type and scale; everything from storing the energy you produce in your exercise machine up to expanding nuclear.  If they ever figure out the photosynthesis model of using the sun then we're literally set for life.  Diversifying energy consumption means that we increase the number of different kinds of energy a consumer can use and decrease the amount of energy the consumer requires in order to be effective.  There will always be both high energy and low energy systems that have purpose and value.  Efficiency of consumption still matters.   Ideally we will produce and require less energy to do more.  This is not a pipe dream.  This is innovation and efficiency.

Transforming human transportation means that we change the way we think about how people move around geographically.  We need to look at the logistics of national and local transportation free from the constraints of mode.  When it comes to moving people, we will also place great emphasis on the preferences of the individual.  These might be combined into the notion of demand.  Given an understanding of demand (past, present, future?) we can make better decisions among biomotive, automotive, locomotive, hydromotive, and aeromotive modes of transportation (most of which are not English words, by the way).  I'm not sure I have good answers, but I'm certain transportation is A) complex and B) broken.  That usually tells me we need to turn the problem on its head and look at it sideways to recognize what will work.

Opening more goverment data means that local, state, and federal governments and their agencies commit to putting all information already in the public domain online in standard, ubiquitous formats that are easy to publish, syndicate, transform, integrate, and view.  Governments should spend less time and money on applications (analysis and visualization tools, websites/portals).  Governments should spend more time and overall less money on opening up our data vice building applications.  Naturally, government-internal uses of data will require tools and applications.  But in all cases better tools and applications will emerge when data is open.  This may be the easiest and least costly priority to implement.

Boosting resurgence in manufacturing means that we recreate and reinvigorate our strength and ingenuity as skilled manufacturers.  More specifically, we invest in:
  1. High-tech, lightweight manufacturing; tools, devices, sensors, robotics and the like
  2. Transportation manufacturing; planes, trains, and automobiles 
Maybe farming equipment becomes an emerging market (see priority 5).  Obviously auto manufacturing needs a new direction.  Flying is a horrendous and toxic experience on many levels.  And trains are poised for a comeback here in the U.S.  Clearly transportation of goods is important and deserves attention.  But I think transportation of goods is largely solvable while transportation of people seems nearly, but not quite, intractable.

Transforming farming practices means we find the middle ground between niche farming and factory farming.  Niche farming is not efficient or practical and is more costly than Americans will endure.  The factory farming model is failing in the current environment.  There is entirely too much land not being farmed in an effort to control prices and trade, yet prices are high and trade is poor.  Some argue that cheap food is gone.  I'm not so sure.  The reason sited is the high cost of transportation undercutting the (tightly) controlled production practices that created cheap food in the first place.  I agree those practices should end.  But as long as we are purposely not farming land I don't know how we can make accurate determinations about price realism in food.  Here in Virginia and in southerly neighbor North Carolina we have acres and acres of underutilized, fertile farmland.  Whether it is land that goes unused because a mid-sized farmer can't compete in a factory-farm environment or old tobacco farms that must transform or die, a new strategy for farming can make use of that land, uplift local economies, keep choices up and prices low.

Then again, what do I know?  This is more feeling than fact.  I could be terribly, terribly wrong.  Clearly I'm short of specifics.  What I'm after is strategic priorities.  This feels like what matters.