Research :: Off the Top :: vanderwal.net

Off the Top: Research Entries

Showing posts: 1-15 of 35 total posts

6 June 2026

Personal Blog Data Analysis - Looking at 25 Years

After adding sparklines to my category lists (Updated Categories with Sparklines and Search is Now in Production) I wanted to have a deeper dive looking at my categories and blog analytics over 25 years.

Category Long Tail

I done a very quick capture of category usage to look at the distribution of use. A question from James about whether my category distribution looked like a long tail distribution and I thought it may, but also looking at the numbers and not having a visualization I wasn’t sure. Charting the use, it really was a very long tail / power law distribution.

this chart is described in the test that follows

I shared it with James and he also ran his and ended up with much the same (Is there a power law of category use? - James’ Coffee Blog). There have been a few discussions of late around category use and some lean into having just a few categories. I have just over 200 categories now as most of my blog post have more than one subject and I use the categories to have an way to jump to related posts that cover the same subject. When I built my site’s CMS I wanted to have the capability to have multiple categories on each post. I have multiple categories for my own purposes, but also I’m cognizant that readers may have other terms.

With the long tail use of categories I know readers may stumble across a post through web search or a link from else where and having a category term that is familiar can get them to other things I may have posted. I view the web as being able to connect with others and blog posts are sharing things I have interests in or curiosity around and being able to connect with others in a similar mindset is the aim. So a handful of categories, particularly across 25 years and over 2,100 posts, doesn’t help build those connections.

Helpful as a Good First Pass

This analysis and data visualizations were helpful to see into my 25 years of posts. There are some analysis sets and data visualizations that need more work. Most of these are more helpful with Plotly in Jupyter and the ability to interact with the visualizations.

I am really curious with what this will look like when I look at Twitter usage and notes. Obsidian on top of my notes make note making easier and far more helpful with backlinks / wiki links. I started using it on top of my directory with notes in June 2020 that had around 2k notes in it going back to 2003. Now there are around 6k to 7k and in the past about half of these notes would have been on one of my blogs.

Posted at 12:13 AM from Bethesda, Maryland.
Marked as :: Blog :: Browsing Structure :: Data Analysis :: Information Architecture :: PIM / PKM :: Research :: v/d Wal Net Site Development :: in Weblog
[perma link for: Personal Blog Data Analysis - Looking at 25 Years]
No Comments

3 September 2022

Weeknote - 3 September 2022

You are asking, “Where are you? Are you okay? Are you still blogging?”

In TikTok parlance, “Great questions. Let me tell you.” First, this standard TikTok pattern is one I find really interesting. It fills in he politeness / nicety gap that has become common in the last decade or two, where people jump into answering questions. This nod to thanking the person asking encourages questions and puts people at ease who asked a question (speaking up is often not something most people are comfortable with). But, the pattern has been used so much and is just a common / required custom, it starts to come off as forced or canned, much like required legal disclaimers. None-the-less, it is a good practice.

Well this was a long “week” (parts of this were in an end of March weeknote that needed finishing, so now edited and updated). Things on the work front got incredibly busy and hectic. I’m going to treat this “weeknote” as a catch-up of things that have held my attention over the past year.

I’m hoping to get back to posting regular weeknotes and blogging. My other blog Personal Infocloud has been quite for a long time, but been waiting for about 2 years for SquareSpace to fix a defect that impacted styling and showing full posts. I have a lot of older content I’ve long used in presentations and workshops, that I’m working to turn into videos of some of the pieces of them that are clearer for understanding in video / animated form. I’m also back working on the 70 plus set of social / complexity lenses I’ve been working on for around 14 years with that label, but around 20 years all together going back to the Model of Attraction (still a foundation for a lot of thinking and framing).

With my son off to college, I may have a little more time to write and share. I’m also looking at a digital garden model (see the last section) and as of recently Massive Wiki for a collaborative or commons approach of moving the Lenses forward (well outward).

Note Taking

I have been deep into cataloging, reading, and using the heck out of Obsidian since trying it out in June of 2020 and going all in at the end of July 2020 and it is now second nature. But, this built on my 10 or so years of taking markdown notes in a directory, which I had 8 to 10 year of text notes in that same directory (which were bulk renamed to markdown). My approach and use of aliases and front matter have changed how I do things, but more on that in the Productivity section below. Many of my issues in a quick test of Roam proved to save me from that path and set of problems, Notion not being mine and not a standard file format so I can reuse the notes easily has stopped being used, I use DevonThink but its backlinking and attempt at other Obsidian functionality was clumsy in my source archive (and I just pull in my directory that Obsidian sits on top of so search is relevant with resources saved), and with Obsidian now having iOS capability I’m really using it a lot on the go. I have a seriously strong preference for having the notes be separate from a system that wrangles and provides organizing for and around them. Having used nvAlt for nearly 10 years when it broke badly and wouldn’t open, all of the 2,500 or so markdown (and text) notes that sat under the app in a directory (and linked with file metadata tags). Putting Obsidian in the same notes directory and crating that directory as a vault things just continued on, but now with far more functionality.

One irony is my use of Obsidian, and in particular my daily notes (Daily Dump), has me posting and sharing here less. It is ironic as I write in the Daily Dump as if I am writing to others, but the notes are just to myself (for now - this may change if I can sort out how to keep some of the reading, learning, observations, etc. separate from work or formative observations. The Daily Dump was partly intended to capture things that could be shared back out in a weeknote. Things like the Personal Operating System, which I found insanely insightful (read below), things from Sentiers and The Near Future Lab (particularly around Generalists, which I find quite similar to a bumping into a brightline for polymaths, but also bumps up against Jane McConnell’s book The Gig Mindset Advantage (more on these later as well).

With Obsidian having tags (used to aggregate related things, as a hook metaphor I’ve used for 18 years or more) and the backlinks to use as bridges to move to related materials and ideas much in the way any hypertext environment functions. I used VooDoo Pad on my Macs for roughly 15 years, but it not easily working across iPad and phone to easily read, edit, or add to the corpus had it shift out of my main workflow. I also use Drafts for quick input from mobile and sometimes iPad.

Obsidian has been amazing with its pace and quality of development over the last 2 years. The iPad version is pretty solid, but I’m usually in reach of my iPad so I’m not leaning on it all that much at the moment. This past week there were large changes to the insider build for 0.16 (it was reworking some underpinnings to improve many of community built plug-ins, themes, and templates), but it was the first time the updates broke things in my workflow rather badly. Normally updates cause no problems, but only offer benefits and improvements, with occasional bumps that are resolved in 5 to 10 minutes. But, with the bump this week, I still love it for thinking through writing and note capturing and interlinking. There isn’t anything out there that is close to it.

Read

I have been reading a lot, with a good portion coming through me trusty RSS feed reader, NetNewswire, which echoes my vanderwal.net links page.

Newsletters

While I am not a huge fan of newletters (mostly the part that they arrive in email and not RSS, but the ones that also have RSS feeds are the ones I have been sticking to). Many of these arrive on Sunday, but I really wish it were Friday night or early Saturday morning so I have Saturday and Sunday mornings to get through them and follow the links and devour what is there, but Sunday mornings many arrive and I spend the week going through them.

The one that I am a huge fan of from a general purpose is Patrick Tanguay’s own newsletter Sentiers, which I find to be a real gem. I lost track of Patrick for a while after his Alpine review stopped publishing. I would love to see his Sentiers grow to be a bit more as I find it to be such a good offering. I have been reading it regularly for about a year or a bit more and Patrick has been popping up on podcasts I follow (Near Future Lab (now mostly moved to a Discord for and The Informed Life - more on these later).

Jorge Arango’s newsletter, Informa(c)tion, is an information and organization focussed gem that arrives every other Sunday. There are always good pieces and the links are gems.

Others I really enjoy and tend to link to things that open more browser tabs are: The Marganilian by Maria Popova; Curtis McHale’s PKM newsletter; and Monocle Weekend Edition newsletter (there are many times of late were the newsletter is a bit off target, but the balance for me mostly entertainment).

Books

The Gig Mindset Advantage has been a gem, mostly as it is very familiar as it is pretty much my natural (unintended) MO (modus operendi).

The Map of Knowledge, by Violet Moller has become one of my favorite books. It quickly turned into a slow meditative read as it broke some of my prior understanding of the world of knowledge and creation of advanced math, sciences, and philosophy. This refactoring of my understanding was around the realization that most of the “great books” and works that are the foundations are just very tiny slivers of knowledge that made it through an insanely fragile process of keeping paper copies. A book would need to be hand-written to create a copy and that copy on paper would only last 50 to 80 years before it would heavily decay. I knew well that most of what Western Europe used as fodder in the Renaissance for advancement were works from ancient Greece that had been kept alive through the great libraries and education systems in Persia, Near East, Middle East, Arabian regions, North Africa, and Moorish efforts in Spain. The realization that were may not be looking at the best thinking from the “classics”, but just those that made it through time.

While I had a decent understanding of the vast contributions advancing math, sciences, medicine, and philosophy that are the foundations of much of western thinking, I didn’t know much of the who, when, and where. The Map of Knowledge goes into these areas with a very good level of understanding. The book also does a great job laying out the cycle of life for advanced learning and libraries in each of the regions and progressions through time. One of the common cycles that causes the downfall in many regions was gap in the civilization between those with advanced knowledge and learning and those in power, as well as regular poeple. That chasm between the advanced and those not caused a lot of friction, most often leading to the destruction of libraries and institutions. Many of the civilizations never returned to anything close to the advancements. But, the libraries and learning institutions dispersed and found new benefactors and locations to continue moving forward.

Violet Moller has certain given me a good foundation to learn more.

Gillian Tett’s Anthro-Vision was a book, well a chapter in that book, I’ve been waiting for for many years. The chapter on “Financial Crisis” where Tett had been researching financial markets using her background as an anthropologist for a new role at The Financial Times. Tett followed the paths of understanding in the way a good ethnographer / anthropologist does looking to understand the quiet, and seemingly foundational, areas that seem to be out of focus. This area was that of credit swaps in financial loan markets, which were what caused the 2007 housing market collapse and in 2008 at the massive meltdown of the financial markets. The model for building understanding is one that should be common, but sadly isn’t. The remainder of the book is quite good as well.

Productivity

The biggest thing around productivity is my use of Obsidian continues, as mentioned above. In a couple chats recently I have found other have brought up the backlinking / crosslinking as the most valuable feature. For those of us who have been using Macs for a while we find it reminiscent of not only wikis and their power, but in particular VooDoo Pad, which was light weight and everything was easily interlinked and backlinked and search was incredibly good. VooDoo Pad ran locally on your Mac (eventually it also could sync and run on iOS devices, but it needed a special application to run it). The genius bit about Obsidian is it is just markdown notes with an app that acts as an over watcher to connect and index things, but leaves the markdown notes fully usable by any other app or service that can use Markdown.

Having been taking notes in one directory (and its sub-directories) for some 20 years the ability to always get to my notes and use them is highly valuable. I have run through numerous other apps (particularly cloud based) that just die or go away as they are no longer popular or the owners have the wonderfully tragic combination of being ignorant and arrogant. I can pick-up any of my notes in markdown that have backlinks and they also function in Drafts or other programs. The principle of Small Apps Loosely Joined still has resonance and deep value.

Along with Obsidian and the backlinked notes, I have also been keeping a keen eye on Digital Gardening (Maggie Appleton explains this really well and had links to others also diving deeply). At some point I also stumbled upon Software for your second brain - The Stack Overflow Podcast with Alexander Obenauer talking about his quest for creation of a “personal operating system”, which he shares out in his Lab Notes. Much of Alexander’s quest became refocussed on Obsidian as it was doing a lot of what he needed and was trying to frame out so to build it. He has crated extensions to Obsidian to close some of his perceived gaps, but the underlying principle is data portability and a concept incredibly close to the Small Apps Loosely Joined.

Posted at 6:58 PM from Bethesda, Maryland.
Marked as :: Books :: Complexity :: Complexity Lenses :: History :: Information Aggregation :: Knowledge Management :: Library Science :: Obsidian :: Personal :: Philosophy :: Productivity :: Research :: Social Lenses :: Social Software :: Weeknote :: Workflow :: in Weblog
[perma link for: Weeknote - 3 September 2022]
No Comments

3 October 2020

Rebuilding My Note Taking and Management System and Model

The past many weeks I have been digging into a better note taking and management method, while also embracing what I have and my core underlying principles. A continual genre in YouTube I watch is around productivity, particularly around personal knowledge management methods and tools. A couple years back I ran into Zettelkasten Method, that comes from Niklas Luhmann, which focuses on his prolific reading and his card catalogue and related note taking system. Then a few months back I heard Jorge Arango’s interview with Beck Tench it drew Zettelkasten back into focus. The interview with Beck focussed on Tinderbox, which I love, but I also want mobile access to my notes from phone and tablet.

Early Exploration

I have been using Notion a little bit, but my only use the last few months is as an interstitial capture for YouTube and some other rich media. [I like Notion and it seems like a modern take on Podio and has a similar downfall of not sorting out an adaptive data structure for interoperability and consistency.] But, the communities that are interested in Notion became obsessed with Roam Research, so I looked at Roam. Roam and Notion are two vastly different approaches, which can complement each other but in to way replace each other. But, each has a similar faults, no API, no standard export for structured information, and fully cloud based. That is too many common failure points wrapped into one product (Notion is working on and API, which is really good). Roam bugged me most because it relies on an outline format but has no clue about OPML exporting, but worse has no good export model. The cloud based, which requires being connected and online is a model I really don’t like as, particularly if their isn’t a local sync nor standard data format model. What I really like about Roam is its block focussed format, that is akin to purple numbers model of small chunks that are addressable and reusable.

In this time of looking what a next generation of quick note taking would look like, but long used tool, NValt failed spectacularly, in that it would not find my directory where my 1,200+ notes were stored, nor could I add new notes. Fortunately all of my notes are in plain markdown text files, so all I was missing was my tagging of the files in NValt (Brett Terpstra who created NValt has been working on a new tool that can replace NValt but has been taking forever to show up and my need became immediate). This is one of the common reasons for owning my own notes and having them locally and not using somebody else’s model and framework. But, also using the [small apps loosely joined] model where many tools pointing at well formatted / structured data / information can function to their best ability and can use their strengths without breaking anything with the information / data.

Seriously Looking at Note Taking and Management Tools

I started looking at about five or six different note taking tools. I was building out a rough attribute model of tools to help see what each offered or didn’t. I am needing to write this up, but it started with watching Mike and Matty’s, Notion vs Roam vs Obsidian vs Remnote - How to best fit note taking app for you and using their criteria as a base, then building on it. Obsidian and Remnote were already on my list, but also included Zettelnote, Zettlr, and a couple that extended Tidlywiki for a Zettelkasten type model. I also included OmniOutliner as that has been (and will be) my core outlining tool that interplays well with OPML and I can back and forth with good mind mapping tools that also output and import OPML data standard. I also included DevonThink Pro as it is my long used (since 2005) note and information storage and smart search tool (it already was indexing my notes directories) that there is no chance I’m going to give up, but also knew it didn’t have the core functionality I was seeking, wiki-style back linking.

I did a quick test or Roam and ruled it out as it broke rules I try not to break, and it broke many of them (biggest one is know now you are going to exit before you enter anything and a lack of any structure nor API made it a giant risk I’ve been burned by too many times, but the developers have a lot of arrogance about their approach that far too often leads to disasters - sometimes the kindest, smartest, and solid planning people end up with disasters that I feel very badly about but arrogance and ignorant I don’t).

Zettlr and Remnote were next. But the setup took a bit more of me managing and building things and I know when I lose focus those may not be best choices for myself (my past self 15 years ago or more would have loved it and done well with it, but those days are not now).

Obsidian Ticks the Right Boxes and Adapts to My Existing Model

Obsidian is where I put some time. I pointed its “Vault” to my notes directory (and sub-directory) where I had my 1,200 markdown notes already (some of them were .txt extensions, which I did bulk extension swap on) and it could read everything perfectly. One of my first tests was adding backlinks to some of my social lenses and social scaling notes, which worked really well by making related elements connected. I started capturing my notes about what I was doing in Obsidian and the ease of not only connecting things with backlinks, but having the ability to set empty node wiki links (many notes with the same link to a note / page that doesn’t exist yet, but have the same link to it) and then being able to use backlink following from that non-existent notes link list of things pointing to it was insanely valuable.

I have quite a few book list and book note pages already and I started linking them and linking authors and making author pages. I also found I was wanting note page templates for simple book pages in a Zettelkasten model, a book notes template, author / creator template, and a few others. I created these from existing structured notes I’ve used for years and put the outlines in TextExpander using a simple input line or two to label all of the headers with author name or other name.

I started typing out my notes and highlights from books I’ve read and annotated over the years and after the first three or so books I was deeply hooked.

The Use Where Obsidian Showed I was Hooked

Where I knew I was sold was this last weekend I went back to one of Matt Webb’s blog posts on Small Groups that is dense and has links out to great resources. I captured my initial notes on Matt’s post, and annotated relating to his sections. But, I also quickly dug through the linked materials and created and filled out structured note pages for those as well. The James Mullholland post on Small Groups was fantastic and it spidered out to more related resources, so I followed those and took notes. All of this was cross-linked and back-linked and fleshed out small group notes that I have been building as part of social scaling I’ve been writing on and presenting (talks and workshops) for years. The small group size they focus on is roughly team size, but not a team. Both of these are cooperative social models, which scale from teams, groups (small to large groups with similar social interaction models, but the dynamics shift quite a bit around 75 people and break fully about 300 to 500 people), community (everybody inside a firewall or inside an walled off construct), and network (inside and outside a firewall - so for business it is customers, contractors, consultants, vendors, etc. where there needs to be a safe model for sharing information with shared goals as different roles with their purpose come together for back and forth exchange) - more can be found in my related write-up 5 Core Insights for Community Platforms Today.

This note taking and contextualizing and cross linking to rip through and gut a series of related and interrelated pieces has been something I’ve long looked for and wanted. Many dog years ago in college I took reading notes on note cards with citations and context. When writing a paper / essay I would assemble the note cards in an order that could tell a story. Then I would build an outline in WordStar and type in the quotes. Then I would write the narrative and wrapper. Obsidian is starting to get at that, but ripping through a resource to pull out highlights, quotes, annotations, and notes is utterly fantastic. It gives me a solid resource to easily pull together ideas and supporting information.

Other Obsidian Capabilities

Obsidian can show two note pages at once so to easily copy book citation information from the structured book note file into the book note page. The multiple notes in panels also works well for copying quotes to quote pages and cross linking.

Using Obsidian and Still Working from Mobile and Tablet

The mobile use essential had been broken for a bit after Dropbox stopped supporting softlinks in Mac and requiring that to be native in Dropbox and doing the softlink from the Mac to Dropbox. I moved the directory to Dropbox, which leaves a copy locally usable should something happen to Dropbox and added a softlink for local backups. I pointed DevonThink to this directory to index and I was back running. Now I can use Drafts to take a quick note from my iOS devices and push it to the notes directory (later go back and fix the file name) and I have good inbound notes and can use backlinks (which I test later). This method also works for share sheet to Drafts from Overcast or YouTube and having the link to the media and the notes all pulled in.

Happiness with notes has been missing for a while, perhaps happiness has returned.

Resources

Posted at 6:59 PM from Bethesda, Maryland.
Marked as :: Collaboration :: Contextual Design :: Folksonomy :: InfoCloud :: Information Aggregation :: Information Architecture :: Knowledge Management :: Note Taking :: Obsidian :: PIM / PKM :: Personal :: Productivity :: Reference :: Research :: Resource :: Social Software :: Software :: Technology :: in Weblog
[perma link for: Rebuilding My Note Taking and Management System and Model]
No Comments

13 March 2016

PubPub from MIT Media Lab

I just stumbled into MIT Media Lab’s PubPub service that is an open platform for writing new research journals. It has some nice collaborative features, versioning, embraces markdown at its core, and inline discussions.

Social Circles

One of the pieces I really want to explore is how its social dimensions work. My my take on different things I write and have interest in have different social circles that I want to ping and get feedback from. But, there are subjects and groups / communities that I really would like to participate in as well. This is a more complicated and complex area that really needs work and focus. Google+ tried this but deeply flubbed it as their circles are based on individual’s perspectives and not socially constructed realities (knowing the boundaries of who is involved in a circle and having really solid social interaction design around that is a basic requirement, something nobody at Google seemed to consider nor have basic foundations in social sciences to understand this basic need). For PubPub, getting these constructs right would be really helpful and make it a really powerful service and platform.

Other than PubPub

PubPub is fairly close to what I was hoping Poetica would become and Draft app. I’ve been thinking about this for The Lenses and its subset, Social Lenses writings. Also just being able to write blogs and get knowledgeable sanity checks on them from others before posting. This is something I was trying to do with Draft app and had some success with people who are familiar with markdown (which is most people I interact with), but alerting people or subsets of groups that there is something I would really like early looks at and feedback is where it falls a bit short. It also seems like Nate Kontny is now more focussed on Highrise (light CRM service that he took over and now is CEO) than Draft. Also with the purchase of Poetica and its imminent shuttering, I’m looking at other options.

Some of what I have interest in can be found in Medium, but I’d rather just syndicate there, given their use policy. Medium is a really nice content creation platform, with some okay drafting with feedback capabilities, but I’m looking for a bit more. I also really prefer Markdown approach these days as it keeps things really light and I can edit and work on writing from most any platform I have with me or at my access, even if I’m lacking a network connection (which is something that is really helpful for focus for me actually).

One PubPub Wish

The one thing I wish PubPub had was an open source version where I owned the platform and could run it on a server of my choosing. But, it looks like this is in the plans (a few bugs need to be squashed on their way to this), as the PubPub About page states.

Posted at 4:15 PM from Bethesda, Maryland.
Marked as :: Collaboration :: Content :: Information Application Development :: Knowledge Management :: Research :: Social Software :: Writing :: in Weblog
[perma link for: PubPub from MIT Media Lab]
No Comments

28 September 2010

As If Had Read

The idea of a tag "As If Had Read" started as a riff off of riffs with David Weinberger at Reboot 2008 regarding the "to read" tag that is prevalent in many social bookmarking sites. But, the "as if had read" is not as tongue-in-cheek at the moment, but is a moment of ah ha!

I have been using DevonThink on my Mac for 5 or more years. It is a document, note, web page, and general content catch all that is easily searched. But, it also pulls out relevance to other items that it sees as relevant. The connections it makes are often quite impressive.

My Info Churning Patterns

I have promised for quite a few years that I would write-up how I work through my inbound content. This process changes a lot, but it is back to a settled state again (mostly). Going back 10 years or more I would go through my links page and check all of the links on it (it was 75 to 100 links at that point) to see if there was something new or of interest.

But, that changed to using a feedreader (I used and am back to using Net News Wire on Mac as it has the features I love and it is fast and I can skim 4x to 5x the content I can in Google Reader (interface and design matters)) to pull in 400 or more RSS feeds that I would triage. I would skim the new (bold) titles and skim the content in the reader, if it was of potential interest I open the link into a browser tab in the background and just churn through the skimming of the 1,000 to 1,400 new items each night. Then I would open the browser to read the tabs. At this stage I actually read the content and if part way through it I don't think it has current or future value I close the tab. But, in about 90 minutes I could triage through 1,200 to 1,400 new RSS feed items, get 30 to 70 potential items of value open in tabs in a browser, and get this down to a usual 5 to 12 items of current or future value. Yes, in 90 minutes (keeping focus to sort the out the chaff is essential). But, from this point I would blog or at least put these items into Delicious and/or Ma.gnolia or Yahoo MyWeb 2.0 (this service was insanely amazing and was years ahead of its time and I will write-up its value).

The volume and tools have changed over time. Today the same number of feeds (approximately 400) turn out 500 to 800 new items each day. I now post less to Delicious and opt for DevonThink for 25 to 40 items each day. I stopped using DevonThink (DT) and opted for Yojimbo and then Together.app as they had tagging and I could add my context (I found my own context had more value than DevonThink's contextual relevance engine). But, when DevonThink added tagging it became an optimal service and I added my archives from Together and now use DT a lot.

Relevance of As if Had Read

But, one of the things I have been finding is I can not only search within the content of items in DT, but I can quickly aggregate related items by tag (work projects, long writing projects, etc.). But, its incredible value is how it has changed my information triage and process. I am now taking those 30 to 40 tabs and doing a more in depth read, but only rarely reading the full content, unless it is current value is high or the content is compelling. I am acting on the content more quickly and putting it into DT. When I need to recall information I use the search to find content and then pull related content closer. I not only have the item I was seeking, but have other related content that adds depth and breath to a subject. My own personal recall of the content is enough to start a search that will find what I was seeking with relative ease. But, were I did a deeper skim read in the past I will now do a deeper read of the prime focus. My augmented recall with the brilliance of DevonThink works just as well as if I had read the content deeply the first time.

Posted at 1:06 PM from Bethesda, Maryland.
Marked as :: Apple/Mac :: Attraction :: Browsers :: Folksonomy :: InfoCloud :: Information Aggregation :: Knowledge Management :: Metadata :: PIM / PKM :: Personal :: RSS :: Reference :: Research :: Resource :: Searching :: Software :: in Weblog
[perma link for: As If Had Read]
No Comments

13 June 2007

Folksonomy Provides 70 Percent More Terms Than Taxonomy

While at the WWW Conference in Banff for the Tagging and Metadata for Social Information Organization Workshop and was chatting with Jennifer Trant about folksonomies validating and identifying gaps in taxonomy. She pointed out that at least 70% of the tags terms people submitted in Steve Museum were not in the taxonomy after cleaning-up the contributions for misspellings and errant terms. The formal paper indicates (linked to in her blog post on the research more steve ... tagger prototype preliminary analysis) the percentage may even be higher, but 70% is a comfortable and conservative number.

Is 70% New Terms from Folksonomy Tagging Normal?

In my discussion with enterprise organizations and other clients that are looking to evaluate their existing tagging services, have been finding 30 percent to nearly 70 percent of the terms used in tagging are not in their taxonomy. One chat with a firm who had just completed updating their taxonomy (second round) for their intranet found the social bookmarking tool on their intranet turned up nearly 45 percent new or unaccounted for terms. This firm knew they were not capturing all possibilities with their taxonomy update, but did not realize their was that large of a gap. In building their taxonomy they had harvested the search terms and had used tools that analyzed all the content on their intranet and offered the terms up. What they found in the folksonomy were common synonyms that were not used in search nor were in their content. They found vernacular, terms that were not official for their organization (sometimes competitors trademarked brand names), emergent terms, and some misunderstandings of what documents were.

In other informal talks these stories are not uncommon. It is not that the taxonomies are poorly done, but vast resources are needed to capture all the variants in traditional ways. A line needs to be drawn somewhere.

Comfort in Not Finding Information

The difference in the taxonomy or other formal categorization structure and what people actually call things (as expressed in bookmarking the item to make it easy to refind the item) is normally above 30 percent. But, what organization is comfortable with that level of inefficiency at the low end? What about 70 percent of an organizations information, documents, and media not being easily found by how people think of it?

I have yet to find any organization, be it enterprise or non-profit that is comfortable with that type of inefficiency on their intranet or internet. The good part is the cost is relatively low for capturing what people actually call things by using a social bookmarking tool or other folksonomy related tool. The analysis and making use of what is found in a folksonomy is the same cost of as building a taxonomy, but a large part of the resource intensive work is done in the folksonomy through data capture. The skills needed to build understanding from a folksonomy will lean a little more on the analytical and quantitative skills side than the traditional taxonomy development. This is due to the volume of information supplied can be orders of magnitude higher than the volume of research using traditional methods.

Posted at 1:27 AM from Bethesda, Maryland.
Marked as :: Attraction :: Folksonomy :: InfoCloud :: Information Aggregation :: Internet :: Intranet :: Knowledge Management :: Metadata :: Museum :: Research :: Searching :: User-Centered Design :: Web apps :: Web Services :: in Weblog
[perma link for: Folksonomy Provides 70 Percent More Terms Than Taxonomy]
No Comments

17 January 2006

Folksonomy Research Needs Cleaning Up

After getting flooded with e-mail yesterday about the Folksonomies: Tidying Up? in the January DLIB 2006 and yes I agree that by using Flickr as a base for much of their analysis they made a mess of their conclusions. Please go see Explaining and Showing Broad and Narrow Folksonomies to begin to get an understanding of why Flickr is not a great example of folksonomy. Showing tag distributions when tagging is limited by the tool (Flickr only permits one of each tag and does not allow identification of the person tagging, unless the API is used) is rather pointless. The central focus of a folksonomy is for personal refindability and derived from that point we get great value.

I would love to see this research redone with a better understanding of folksonomy and run the research on broad folksonomy tools like del.icio.us, Furl, Shadows, etc.

Posted at 2:52 PM from Bethesda, Maryland.
Marked as :: Folksonomy :: Library Science :: Research :: in Weblog
[perma link for: Folksonomy Research Needs Cleaning Up]
No Comments

22 August 2005

Personal, Portable, Pedestrian in My Hands

I am glad to have my my newest book in my hands finally. It is Personal, Portable, Pedestrian: Mobile Phones in Japanese Life edited by Mitzuko Ito, Daisuke Okabe, and Misa Matsuda. Last year's trip to Amsterdam floored me at how far behind we in the U.S. are with mobile (as well as broadband, which was really amazing). In talking with others on my trip they were pointing out how much farther ahead Japan is than Europe. What do I mean farther ahead? The trends in personal usage of mobile devices are two or three years ahead of the U.S. How people interact and use their mobile devices (text, web, information interaction, etc.). I have been watching the trends I read about in magazine articles and heard in conversation flow from other countries and after years bubble up in U.S. culture. My interest in the Personal InfoCloud draws me toward deeper understandings on personal devices around the world.

Ethnographic insights interest me, particularly for cultures and interactions with technology that I can not witness first hand. This book has come highly recommended and having met Mimi this past Spring I really have been looking forward to this book.

Wired has an overview of Personal, Portable, Pedestrian. You may also want to look at the publisher's MIT Press, page for Personal, Portable, Pedestrian.

Posted at 3:25 AM from Bethesda, Maryland.
Marked as :: Book Review :: Mobile :: Research :: in Weblog
[perma link for: Personal, Portable, Pedestrian in My Hands]
No Comments

12 July 2005

The World in Our Hands

SmartMobs announces It is official, there are more cellphones lines than landlines in the U.S.. I was thinking about this in the past couple weeks. We have already started seeing text and data uses tipping our mobile hands (it is about time we started getting to where much of the rest of the globe has already been).

Now if I could just keep my finger on the number of data enabled phones and the lesser number of laptop/desktop internet connections for the globe. Every time I see this number I forget to mark it or grab it.

[Hat tip Anne]

Posted at 12:51 AM from Bethesda, Maryland.
Marked as :: Data Analysis :: Mobile :: Reference :: Research :: Resource :: Telecom :: in Weblog
[perma link for: The World in Our Hands]
No Comments

30 May 2005

Academic Cites for Interested Parties

One of the things that I am still mulling over that came out of the Social Software in the Academy Workshop is the relationship between academic cites and interested parties (non-academics researching, thinking deeply, and writing about a subject). Over the past year I have had some of the work I have posted on my web sites cited in academic papers. These papers have been for general coursework to graduate thesis.

In the academic realm these cites in other's works give credibility and ranking. In the realm of the professional or "interested party" these cites mean little (other than stroking one's ego). These cites do not translate to higher salary, but they may have some relationship to credibility in a subject area.

Another aspect is finding a way to tie into academic work around these subjects. There are often wonderful academic related gatherings (conferences, symposia, etc.) around these subject matters, but these are foreign to the "interested party". There is a chasm between academic and professional world that should be narrowed or at a minimum bridged in a better way. At SSAW there were some projects I found out about that I would love to follow, or even contribute to in some form (advisement, contributor, etc.).

I have a feeling I will be mulling this for some while, and will be writing about it again.

Posted at 4:47 PM from Bethesda, Maryland.
Marked as :: Community :: Folksonomy :: InfoCloud :: Learning :: Reference :: Research :: Social Software :: Web :: Blog :: in Weblog
[perma link for: Academic Cites for Interested Parties]
No Comments

24 May 2005

Wade Roush and 10,000 Brianiacs

I have been following Wade Roush' continuousblog since its inception a few weeks ago. Continuousblog is focussing on the convergence that is finally taking place in the information technology realm. I had a wonderful conversation with Wade last week and have been enjoying watching his 10,000 Brainiacs evolve in 10,000 Brianiacs, Part 1; 10,000 Brainiacs, Part 2; 10,000 Brainiacs, Part 3; and soon to be 10,000 Brianiacs, Part 4.

Wade's concept of "continuous computing" fits quite nicely in line with the Personal InfoCloud as we have access to many different devices throughout our lives (various operating systems, desktops, laptops, PDA, mobile phone, television/dvr, as well as nearly continuous connectivity, etc.). The Personal InfoCloud focusses on designing and developing with the focus on the person and their use of the information as well as the reuse of the information. It is good to see we have one more in the camp that actually sees the future as what is happening to day and sending the wake-up call out that we need to be addressing this now as it is only going become more prevalent.

Posted at 3:08 AM from Bethesda, Maryland.
Marked as :: Attraction :: InfoCloud :: Internet :: Mobile :: OS :: PDA :: Research :: Technology :: User Experience :: Wireless :: in Weblog
[perma link for: Wade Roush and 10,000 Brianiacs]
No Comments

17 June 2004

Malcolm McCullough Lays a Great Foundation with Digital Ground

Today I finished reading the Malcolm McCullough book, Digital Ground. This was one of the most readable books on interaction design by way of examining the impact of pervasive computing on people and places. McCullough is an architect by training and does an excellent job using the architecture role in design and development of the end product.

The following quote in the preface frames the remainder of the book very well:

My claims about architecture are indirect because the design challenge of pervasive computing is more directly a question of interaction design. This growing field studies how people deal with technology - and how people deal with each other, through technology. As a consequence of pervasive computing, interaction design is poised to become one of the main liberal arts of the twenty-first century. I wrote this book because I ran into many people who believe that. If you share this belief, or if you just wonder what interaction design is in the first place, you may find some substance here in this book.

This book was not only interesting to me it was one of the best interaction books I have read. I personally found it better than the Cooper books, only for the reason McCullough gets into mobile and pervasive computing and how that changes interaction design. Including these current interaction modes the role of interaction design changes quite a bit from preparing an interface that is a transaction done solely on a desktop or laptop, to one that must encompass portability and remote usage and the various social implications. I have a lot of frustration with flash-based sites that are only designed for the desktop and are completely worthless on a handheld, which is often where the information is more helpful to me.

McCullough brings in "place" to help frame the differing uses for information and the interaction design that is needed. McCullough includes home and work as the usual first and second places, as well as the third place, which is the social environment. McCullough then brings in a fourth place, "Travel and Transit", which is where many Americans find themselves for an hour or so each day. How do people interact with news, advertisements, directions, entertainment, etc. in this place? How does interaction design change for this fourth place, as many digital information resources seem to think about this mode when designing their sites or applications.

Not only was the main content of Digital Ground informative and well though out, but the end notes are fantastic. The notes and annotations could be a stand alone work of their own, albeit slightly incongruous.

Posted at 10:52 PM from Bethesda, Maryland.
Marked as :: Attraction :: Book Review :: Communication Theory :: Communications :: Contextual Design :: Digital Media :: Information Aggregation :: Information Application Development :: Interaction Design :: Internet :: Intranet :: Knowledge Management :: Mobile :: Networking :: Quote :: Research :: Technology :: User Experience :: User-Centered Design :: Wireless :: in Weblog
[perma link for: Malcolm McCullough Lays a Great Foundation with Digital Ground]
No Comments

5 March 2004

Tools to Manage Information On Your Personal Hard Drive

I have posted my thoughts on Tools to Manage Information On Your Personal Hard Drive for Mac OS X in particular. I have posted this on my Personal Info Cloud site. This is the first piece of content that I am not posting in both places. This may become a trend as I am spending a fair amount of time thinking through ideas related to the Personal Info Cloud in one place. The Personal Info Cloud has an RSS feed and I will be posting notices that new info has been added there as it happens.

Posted at 12:17 AM from Bethesda, Maryland.
Marked as :: Information Architecture :: InfoCloud :: Research :: in Weblog
[perma link for: Tools to Manage Information On Your Personal Hard Drive]
No Comments

27 January 2004

Project Oxygen Still Alive

Project Oxygen has progressed quite well since we last looked in (Oxygen and Portolano - November 2001). Project Oxygen is a pervasive computing system that is enabled through handhelds. The system has the users information and media follow them on their network and uses hardware (video, speakers, computers, etc.) nearest the user to perform the needed or desired tasks. Project Oxygen also assists communication by setting the language of the voicemail to match the caller's known language. The site includes videos and many details.

Project Oxygen seems to rely on the local network's infrastructure rather than the person's own device. This creates a mix of Personal Info Cloud by using the personal device, but relies on the Local Info Cloud using the local network to extract information. The network also assists to find hardware and external media, but the user does not seem to have control over the information they have found. The user's own organization of the information is important for them so it is associated and categorized in a manner that is easy for them to recall and then reuse. When the user drifts away from the local network is their access to the information lost?

This project does seem to get an incredible amount of pervasive computing right. It would be great to work in an environment that was Project Oxygen enabled.

Posted at 5:45 PM from Bethesda, Maryland.
Marked as :: Collaboration :: Contextual Design :: InfoCloud :: Information Application Development :: Interaction Design :: Intranet :: Knowledge Management :: Mobile :: Networking :: P2P :: Research :: Technology :: User Experience :: User-Centered Design :: in Weblog
[perma link for: Project Oxygen Still Alive]
No Comments

23 January 2004

Keeping the Found Things Found

This weeks New York Times Circuits article: Now Where Was I? New Ways to Revisit Web Sites, which covers the Keep the Found Things Found research project at University of Washington. The program is summarized:

The classic problem of information retrieval, simply put, is to help people find the relatively small number of things they are looking for (books, articles, web pages, CDs, etc.) from a very large set of possibilities. This classic problem has been studied in many variations and has been addressed through a rich diversity of information retrieval tools and techniques.

This topic is at the heart of the Personal Information Cloud. How does a person keep the information they found attracted to themselves once they found that information. Keeping the found information at hand to use when the case to use the information arises is a regular struggle. The Personal Information Cloud is the rough cloud of information that follows the user. Users have spent much time and effort to draw information they desire close to themselves (Model of Attraction). Once they have the information, is the information in a format that is easy for the user or consumer of the information to use or even reuse.

Posted at 10:53 PM from Bethesda, Maryland.
Marked as :: Attraction :: Information Architecture :: InfoCloud :: Information Aggregation :: Knowledge Management :: Mobile :: Research :: Searching :: User-Centered Design :: Web :: in Weblog
[perma link for: Keeping the Found Things Found]
No Comments

1 2 3 | Next »

Off the Top: Research Entries

Category Long Tail

More Analysis on Blog Posts and Categories

Posts per Month

Post Length Over Time

Median Categories per Post

Distribution of Categories per Post

Combined Timeline for Posts, Length, and median

Seasonal Patterns

Top Category Activity Over Time

To 40 Co-Occuring Category Pairs

Category Co-occurence Network Graph